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NOTATION 


VECTOR suffixes are denoted by Latin letters j, k, ... Spin indices are denoted by 
Greek letters «œ, 8, ... Summation is implied over all repeated indices. 

“A-vectors” (see the footnote to equation (13.8)) are denoted by capital letters 
Oe caer 

Volume element dV or d°x. 

Limit on tending to zero from above or below +0 or —0. 

Operators are denoted by a circumflex. 

Hamiltonian 1, H’ = Ĥ— uÑ. 

Perturbation operator V. 

y operators in the Schrödinger representation Y, pt; in the Heisenberg represen- 
tation Y, +: in the Matsubara representaion Y™, PM, 

Green’s functions G, D. 

Temperature Green’s functions G, D. 

Thermodynamic quantities are denoted as in Part 1, for example 7 temperature, 
V volume, P pressure, u chemical potential. 

Magnetic field H; magnetic induction B; external magnetic field Q. 

References to earlier volumes in the Course of Theoretical Physics: 

Mechanics = Vol. 1 (Mechanics, third English edition, 1976). 

Fields = Nol. 2 (The Classical Theory of Fields, fourth English edition, 1975). 

QM = Vol. 3 (Quantum Mechanics, third English edition, 1977). 

RQT = Vol. 4 (Relativistic Quantum Theory, first English edition, Part 1, 1971; 
Part 2, 1974). 

Part 1 = Vol. 5 (Statistical Physics, Part 1, third English edition, 1980). 

FM = Vol. 6 (Fluid Mechanics, first English edition, 1959). 

ECM = Vol. 8 (Electrodynamics of Continuous Media, first English edition, 1960). 

All are published by Pergamon Press. 


PREFACE 


As a brief characterization of its content, this ninth volume in the Course of 
Theoretical Physics may be said to deal with the quantum theory of the con- 
densed state of matter. It opens with a detailed exposition of the theory of 
Bose and Fermi quantum liquids. This theory, set up by L. D. Landau follow- 
ing the experimental discoveries by P. L. Kapitza, is now an independent 
branch of theoretical physics. Its importance is in fact measured not so much by 
even the remarkable phenomena that occur in the liquid isotopes of helium as by 
the fact that the concepts of a quantum liquid and its spectrum are essentially 
the foundation for the quantum description of macroscopic bodies. 

For example, a thorough understanding of the properties of metals involves 
treating the electrons in them as a Fermi liquid. The properties of the electron 
liquid are, however, complicated by the presence of the crystal lattice, and a 
study of the simpler case of a homogeneous isotropic liquid is a necessary pre- 
liminary step in the construction of the theory. Similarly, superconductivity in 
metals, which may be regarded as superfluidity of the electron liquid, is difficult 
to understand clearly without a previous knowledge of the simpler theory 
of superfluidity in a Bose liquid. 

The Green’s function approach is an indispensable part of the mathematical 
formalism of modern statistical physics. This is not only because of the con- 
venience of calculation of Green’s functions by the diagram technique, but 
particularly because the Green’s functions directly determine the spectrum of 
elementary excitations in the body, and therefore constitute the language that 
affords the most natural description of the properties of these excitations. In 
the present volume, therefore, considerable attention is paid to methodologi- 
cal problems in the theory of Green’s functions of macroscopic bodies. 
Although the basic ideas of the method are the same for all systems, the specific 
form of the diagram technique is different in different cases. It is consequently 
natural to develop these methods for the isotropic quantum liquids, where the 
essence of the procedure is seen in its purest form, without the complications 
arising from spatial inhomogeneity, the presence of more than one kind of 
particle, and so on. 

For similar reasons, the microscopic theory of superconductivity is described 
with the simple model of an isotropic Fermi gas with weak interaction, dis- 
regarding the complications due to the presence of the crystal lattice and the 
Coulomb interaction. 

In respect of the chapters dealing with electrons in the crystal lattice and 
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with the theory of magnetism, we must again stress that this book is part of a 
course of theoretical physics and in no way attempts to be a textbook of solid 
state theory. Accordingly, only the most general topics are discussed here, and 
no reference is made to problems that involve the use of specific experimental 
results, nor to methods of calculation that have no evident theoretical basis. 
Moreover, this volume does not include the transport properties of solids, 
with which we intend to deal in the next and final volume of the Course. 

Finally, this book also discusses the theory of electromagnetic fluctuations 
in material media and the theory of hydrodynamic fluctuations. The former was 
previously included in Volume 8, Electrodynamics of Continuous Media. Its 
transfer to the present volume is a consequence of the need to make use of 
Green’s functions, whereby the entire theory can be simplified and made more 
convenient for application. It is also more reasonable to treat electromagnetic 
and hydrodynamic fluctuations in the same volume. 

This is Volume 9 of the Course of Theoretical Physics (Part 1 of Statis- 
tical Physics being Volume 5). The logic of the arrangement is that the 
topics dealt with here are closely akin also to those in fluid mechanics 
(Volume 6) and macroscopic electrodynamics (Volume 8). 

L. D. Landau is not among those who have actually written this book. 
But the reader will quickly observe how often his name occurs in it: a consid- 
erable part of the results given here are due to him, alone or with his pupils 
and colleagues. Our many years’ association with him enables us to hope that 
we have accurately reflected his views on these subjects—while at the same 
time, of course, having regard to developments in the fifteen years since his 
work was so tragically terminated. 

We should like to express here our thanks to A. F. Andreev, I. E. Dzyalo- 
shinskii and I. M. Lifshitz for many discussions of topics in this book. We 
have had great benefit from the well-known book Quantum Field Theoretical 
Methods in Statistical Physics (Pergamon, Oxford, 1965) by A. A. Abrikosov, 
L. P. Gor’kov and I. E. Dzyaloshinskil, one of the first books in the literature 
of physics to deal with the new methods of statistical physics. Lastly, we are 
erateful to L. P. Gor’kov and Yu. L. Klimontovich for reading the book in 
manuscript and making a number of comments. 


April 1977 E. M. LirsHitz 
L. P. PITAEVSKI1 


CHAPTER I 


THE NORMAL FERMI LIQUID 


§ 1. Elementary excitations in a quantum Fermi liquid 


AT TEMPERATURES So low that the de Broglie wavelength corresponding to 
the thermal motion of the atoms in a liquid becomes comparable with the 
distances between the atoms, the macroscopic properties of the liquid are de- 
termined by quantum effects. The theory of such quantum liquids is of consid- 
erable fundamental interest, although there exist in Nature only two such 
that are literally liquids, the liquid isotopes of helium He’ and He‘ at tempera- 
tures ~ J-2°K. All other substances solidify well before quantum effects 
become important in them. In this connection, it may be recalled that according 
to classical mechanics all bodies should be solid at absolute zero (see Part 1, 
§64). Helium, however, because of the peculiarly weak interaction between 
its atoms, remains liquid down to temperatures where quantum phenomena 
come into effect, whereupon it need not solidify. 

The calculation of the thermodynamic quantities for a macroscopic body 
requires a knowledge of its energy level spectrum. In a system of strongly 
interacting particles such as a quantum liquid, we can refer, of course, only to 
levels that correspond to quantum-mechanical stationary states of the whole 
liquid, not to states of the individual atoms. In calculating the partition func- 
tion at sufficiently low temperatures, we are to take account only of the weakly 
excited energy levels of the liquid, lying fairly close to the ground state. 

The following point is of fundamental importance for the whole theory. 
Any weakly excited state of a macroscopic body may be regarded, in quantum 
mechanics, as an assembly of separate elementary excitations. These behave 
like quasi-particles moving in the volume occupied by the body and possessing 
definite energies e and momenta p. The form of the function e(p), the disper- 
sion relation for the elementary excitations, is an important characteristic of 
the energy spectrum of the body. It must again be emphasized that the con- 
cept of elementary excitations arises as a means of quantum-mechanical 
description of the collective motion of the atoms in a body, and the quasi- 
particles cannot be identified with the individual atoms or molecules. 

There are various types of energy spectrum that can in principle occur in 
quantum liquids. There will be completely different macroscopic properties 
also, depending on the type of spectrum. We shall begin by considering a liquid 
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with what may be called a Fermi spectrum. The theory of such a Fermi liquid 
is due to L. D. Landau (1956-1958); he derived the results given in §§1-4.7 
The energy spectrum of a Fermi quantum liquid has a structure which is to 
some extent similar to that of an ideal Fermi gas (of particles with spin +). 
The ground state of the latter corresponds to the occupation by particles of all 
the states within the Fermi sphere, a sphere in momentum space whose radius 
Pris related to the gas density N/V (number of particles per unit volume) by 


N/V = 2-4mp3/3(2nh)S 
= pp/30Ph' (1.1) 


see Part 1, §57. The excited states of the gas occur when the particles pass from 
states of the occupied sphere to some states with p > pp. 

In a liquid, of course, there are no quantum states for individual particles, 
but to construct the spectrum of a Fermi liquid we start from the assumption 
that the classification of energy levels remains unchanged when the interaction 
between the atoms is gradually “switched on”, i.e. as we go from the gas to the 
liquid. In this classification the role of the gas particles is taken by the elemen- 
tary excitations (quasi-particles), whose number is equal to the number of 
atoms and which obey Fermi statistics. 

It is evident that such a spectrum can occur only for a liquid of particles 
with half-integral spin: the state of a system of bosons (particles with integral 
spin) cannot be described in terms of quasi-particles obeying Fermi statistics. 
At the same time it must be emphasized that a spectrum of this type cannot be 
a universal property of all such liquids. The type of spectrum depends also on 
the specific nature of the interaction between atoms. This is clear from the 
following simple consideration: if the interaction is such that it causes the 
atoms to tend to associate in pairs, then in the limit we obtain a molecular 
liquid consisting of particles (molecules) with integral spin, for which the 
spectrum under consideration is certainly impossible. 

Each of the quasi-particles has a definite momentum p (we shall return later 
to the question of the validity of this assertion). Let n(p) be the momentum 
distribution function of the quasi-particles, normalized by the condition 


[nde = N/V, de = dph’; 


this condition will later be made more precise. The classification principle 
mentioned above consists in supposing that, if this function is specified, the 
energy E of the liquid is uniquely determined and that the ground state cor- 
responds to a distribution function in which all states are occupied within the 
Fermi sphere, whose radius pp is related to the density of the liquid by the 
same formula (1.1) as for an ideal gas. 


+ To anticipate, we may mention here for the avoidance of misunderstanding that we are 
referring to a non-superfluid (normal) Fermi liquid, such as is the liquid isotope He, with 
the reservation made in the third footnote to §54. 
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It is important to emphasize that the total energy E of the liquid is not simply 
the sum of the energies £ of the quasi-particles. In other words, E is a functional 
of the distribution function that does not reduce to the integral f ne dr (as it 
does for an ideal gas, where the quasi-particles are the same as the actual 
particles and do not interact). Since the primary concept is E, the question 
arises how the energy of the quasi-particles is to be defined, with allowance for 
their interaction. 

For this purpose, let us consider the change of E due to an infinitesimal 
change in the distribution function. It can manifestly be defined as the integral 
of an expression linear in the variation ôn, i.e. it has the form 


E/V = | e(p)ôn dr. 


The quantity « is the functional derivative of the energy E with respect to the 
distribution function. It corresponds to the change in the energy of the system 
when a single quasi-particle with momentum p is added. This quantity plays 
the role of the Hamiltonian function of a quasi-particle in the field of the other 
particles. It is also a functional of the distribution function, i.e. the form of the 
function (p) depends on the distribution of all the particles in the liquid. 

In this connection it may be noted that an elementary excitation in the type 
of spectrum considered may in a certain sense be treated like an atom in the 
self-consistent field of the other atoms. This self-consistency is, of course, not 
to be understood in the sense usual in quantum mechanics. Here its nature is 
more profound; in the Hamiltonian of the atom, not only is allowance made for 
the effect of the surrounding particles on the potential energy, but the depend- 
ence of the kinetic-energy operator on the momentum operator is also modified. 

Hitherto we have ignored the possible spin of the quasi-particles. Since spin 
is a quantum-mechanical quantity, it cannot be treated classically, and we must 
therefore regard the distribution function as a statistical matrix with respect 
to the spin. The energy e of an elementary excitation is in general not only a 
function of the momentum but also an operator with respect to the spin 
variables, which may be expressed in terms of the quasi-particle spin operator 
S. In a homogeneous isotropic liquid (not in a magnetic field and not ferromag- 
netic) the operator § can appear in the scalar function « only in the form of the 
scalars $? and (§.p)?; the first power of the product §.p is inadmissible, since the 
spin vector is an axial vector and this product is therefore a pseudoscalar. The 
square $? = s(s+1), and for spin s = + the scalar (8.p)? = +p? also reduces to 
a constant independent of §. Thus in this case the energy of a quasi-particle 
is independent of the spin operator, and all the energy levels of the quasi- 
particles are doubly degenerate. 

The statement that a quasi-particle has spin essentially expresses the fact 
that this degeneracy exists. In this sense we can say that the spin of the quasi- 
particles in a spectrum of the type considered is always +, whatever the spin 
of the actual particles in the liquid. For with any spin s other than + the terms 
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of the form (&.p)? would give a splitting of the (2s+1)-fold degenerate levels 
into +(2s+1) doubly degenerate levels. In other words, 4(2s+1) different 
branches of the function «(p) would appear, each corresponding to “quasi- 
particles with spin 4”. 

As already mentioned, when the spin of the quasi-particles is taken into 
account the distribution function becomes a matrix or an operator A(p) with 
respect to the spin variables. This operator may be explicitly written as an 
Hermitian statistical matrix n,,(p), where a and £ are spin matrix indices taking 
the two values +4. The diagonal matrix elements determine the numbers of 
quasi-particles in particular spin states. The normalization condition for the 
quasi-particle distribution function must therefore now be written 


tr fdr = fna dt = NIV, dr = dp/(2nh}, (1.2) 


where tr denotes the trace of the matrix with respect to the spin indices.‘ 
The quasi-particle energy ê is in general also an operator (a matrix with 
respect to the spin variables). It must be defined by 


OE|V = tr | ê ôñ dt = f cup dnp, dt. (1.3) 


If there is no spin dependence of the distribution function and the energy, 
so that n,, and ¢,, reduce to unit matrices: 


Nap = NÒags Exp = EOnp, (1.4) 


then the taking of the trace in (1.2) and (1.3) amounts to simply multiplying 
by 2: 

2{ndv=N/V, 6E/V = 2 f e ôn dr. (1.5) 

It is easy to see that in statistical equilibrium the quasi-particle distribution 

function is an ordinary Fermi distribution, the energy being represented by the 

quantity é defined in (1.3). For, because the energy levels of the liquid and of 


the ideal Fermi gas are classified in the same manner, the entropy S of the 
liquid is determined by a similar combinatorial expression 


S/V = —tr f {Alog A-(1—A) log (1—A)} dr (1.6) 


to that for a gas (Part 1, §55). Varying this expression with the additional con- 
ditions of constant total number of particles and constant total energy, 


SN/V = tr f õñdr = 0, SE/V = tr f è ôñ dt = 0, 
we obtain the required distribution: 
A= [eT +17}, (1.7) 


where u is the chemical potential of the liquid. 


+ Here and throughout, summation is as usual implied over repeated indices. 
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When the quasi-particle energy is independent of the spin, formula (1.7) 
signifies a similar relation between 7 and €: 


n = [e679T 41]. (1.8) 


At T = 0, the chemical potential is equal to the limiting energy on the surface 
of the Fermi sphere: 

[u]r=0 = €r = e(pr). (1.9) 
It must be emphasized that, despite the formal analogy between the expression 
(1.8) and the ordinary Fermi distribution, it is not identical with the latter: 
since « itself is a functional of n, formula (1.8) is strictly speaking a complicated. 
implicit expression for n. 

Let us now return to the assumption that a definite momentum can be 
assigned to each quasi-particle. The condition for this assumption to be valid 
is that the uncertainty in the momentum (due to the finite mean free path of the 
quasi-particle) should be small not only in comparison with the momentum 
itself but also in comparison with the width Ap of the “transitional zone” of 
the distribution, over which it differs appreciably from a step function:* 


O(p) = p)=1 for p< pr, (1.10). 
= 0 for p>pr. 


It is easy to see that this condition is satisfied if the distribution n(p) differs 
from (1.10) only in a small region near the surface of the Fermi sphere. For, 
by the Pauli principle, only quasi-particles in the transitional Zone of the distri- 
bution can undergo mutual scattering, and as a result of this scattering they 
must enter free states in that zone. Hence the collision probability is propor- 
tional to the square of the width of the zone. Accordingly, the uncertainty in the 
energy and hence that in the momentum of the quasi-particle are both propor- 
tional to (Ap). It is therefore clear that, when 4p is sufficiently small, the un- 
certainty in the momentum will be small in comparison not only with pp but 
also with Ap. 

Thus the method described is valid only for excited states of the liquid which 
are described by a quasi-particle distribution function differing from a step 
function in just a narrow region near the Fermi surface. In particular, for 
thermodynamic equilibrium distributions only sufficiently low temperatures are 
permissible. The (energy) width of the transitional zone of the equilibrium 
distribution is of the order of T. The quantum uncertainty in the energy of a 
quasi-particle, due to collisions, is of the order of #/t, where t is the mean free: 
time of the quasi-particle. The condition for the theory to be applicable is 


therefore 
h/t «T. (1.11) 


t For future reference, it may be noted that the derivative 6’(p) = — 6(p—p,), since both 
sides give unity on integration over any range of p that includes the point p = pp. 
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According to the preceding discussion, the time t is inversely proportional to | 
the squared width of the transitional zone: 


ToT. 


so that (1.11) is certainly satisfied as T — 0. For a liquid in which the inter- 
action between particles is not weak, all the energy parameters are of the same 
order as the limiting energy ep; in this sense, the condition (1.11) is equivalent 
to T « |«,|.7 

For almost step-function distributions (i.e. those close to the distribution 
for T = 0), as a first approximation we can replace the functional e by its value 
calculated with n(p) = 6(p). Then £ becomes a definite function of the magni- 
tude of the momentum, and (1.7) becomes the ordinary Fermi distribution. 

Near the surface of the Fermi sphere, where alone the function e(p) has a 
direct physical significance, it can then be expanded in powers of the differ- 
ence p— pr. We have 


E— Ep © Ur(P—PrF), (1.12) 


vr = [0¢/Op]p = pp (1.13) 


is the “velocity” of the quasi-particles on the Fermi surface. In an ideal Fermi 
gas, where the quasi-particles are identical with the actual particles, we have 
€ = p/2m, and so Up = p/m. By analogy we can define for a Fermi liquid 
the quantity 


where 


m* = pr/Ur, (1.14) 


called the effective mass of the quasi-particle; it is positive (see the end of 
§2). 

In terms of the quantities thus defined, the condition for the theory to be 
applicable may be written T « v,p,, and only quasi-particles with momenta 
p such that | p—p,| « pp have any real meaning. This important fact, in 
particular, makes the relation (1.1) between pp and the density of the liquid 
non-trivial, since its intuitive derivation (for a Fermi gas) is based on the 
concept of particles in states occupying the whole Fermi sphere, not just the 
neighbourhood of its surface.* 

The effective mass determines, in particular, the entropy S and the specific 
heat C of the liquid at low temperatures. These are given by the same formula 
as for an ideal gas (Part 1, §58), in which we need only replace the particle mass 
m by the effective mass m”: 


S=C = VyT, y = m'*ps/3h® = (ga)8(m*/)(N/V)¥8; (1.15) 


t For liquid He®, however, the range of quantitative applicability of the theory is shown 
by experiment to be in fact limited to T S$ 0.1 °K (whereas | €p] œ~ 2.5 °K). 

+ The proof of (1.1) involves the use of more complicated mathematical methods, and is 
given in §20 below. 
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because of the linear dependence on T, S and C are the same. This follows 
because the expression (1.6) for the entropy in terms of the distribution func- 
tion is the same for a liquid and for a gas, and in the calculation of this integral 
only the range of momenta near pp is important, in which the quasi-particle 
distribution function in the liquid and the particle distribution function in the 
gas are given by the same expression ( 1.8)." 

Before the theory is further developed, the following remark should be made. 
Although this method of defining the quasi-particles in a Fermi liquid by 
exact analogy with the particles in a gas is the most convenient in systematically 
deriving the theory, the corresponding physical picture has the disadvantage of 
involving the unobservable filled Fermi sphere of quasi-particles. This could be 
eliminated by a formulation in which the elementary excitations occur only 
when T = 0. In such a picture, the elementary excitations are represented by 
quasi-particles outside the Fermi sphere and “holes” within it; the former are 
to be assigned, in the approximation corresponding to (1.12), the energy 
€ = U,(p—Ppp), and the latter the energy € = vp(pr— p). The statistical distri- 
bution of each is given by the Fermi distribution formula with zero chemical 
potential (in accordance with the fact that the number of elementary excitations 
is here not constant, but is itself determined by the temperature)* 


n = felT+1]-}, (1.16) 


The elementary excitations in this picture appear or disappear only in pairs, 
and so the total numbers of excitations with p > pr and p < pp are always 
the same. 

With this definition of the elementary excitations, their energy is certainly 
positive, being the excess of the energy of the excited level over that of the 
ground level of the system. The energy of the quasi-particles defined by (1.3) 
may be either positive or negative. 

Moreover, for a liquid at zero temperature and zero pressure, the quantity 
ep = u is certainly negative, and the values of « close to ep are therefore 
negative also. This is clear, since, when T = 0 and P = 0, — u is a positive 
quantity, the limiting value of the heat of evaporation of the liquid per particle. 


t For liquid He? at zero pressure, p/f = 0.8X 108 cm™!; m* = 3.1 m (He?); pp is found 
from the density of the liquid, and m* from its specific heat. 

t It will be recalled (cf. Part 1, §63) that under such conditions the number of quasi-parti- 
cles N,p is determined by the condition for thermodynamic equilibrium: the free energy F is a 
minimum as a function of Nj» for given temperature and volume: (@F/ON,») r, y = 0. This 
derivative is, however, just the “chemical potential of the quasi-particles”; it should not be 
confused with the chemical potential u of the liquid, which is determined by the derivative 
of F with respect to the number of actual particles N. 
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§ 2. Interaction of quasi-particles 


The energy of the quasi-particles, being a functional of their distribution 
function, varies with that function. The change of energy for a small deviation 
ôn of the distribution function from the step function (1.10) must be 


Seop(P) = | fay, pol, P') Smte,(p’) dr’ (2.1) 
or, in a more symbolic form, 
da(p) = tr’ f F(p, p’) dA(p’) ar’, 


where tr’ denotes the trace with respect to the pair of spin indices that cor- 
respond to the momentum p’. The function f may be called the interaction 
function of the quasi-particles; in a Fermi gas, f = O. By definition, it represents 
the second variational derivative of the total energy E of the liquid, and is 
therefore symmetrical in the variables p, p’ and the corresponding pairs of 
spin indices: 

Say, BoP, P') = fya, op(P’s P). (2.2) 


With the change (2.1), the energy of the quasi-particles near the surface of 
the Fermi sphere is given by the sum 


&(p)—er = vr(p—pr)+tr’ f (p, p’) ôA’) ae’. (2.3) 


In particular, for thermodynamic equilibrium distributions, the second term 
in (2.3) gives the temperature dependence of the quasi-particle energy. The 
deviation 6/’ is appreciably different from zero only in a narrow band of p’ 
values near the surface of the Fermi sphere, and this contains the momenta 
p of actual quasi-particles. The function f (p, p’) in (2.1) and (2.3) can therefore 
be replaced in practice by its value on that surface, putting p = p’ = Ppr, 
so that f will depend only on the directions of the vectors p and p’. 

The spin dependence of the function f is due both to relativistic effects 
(spin-spin and spin-orbit interaction) and to the exchange interaction. The 
latter is the most important. When it is taken into account, the quasi-particle 
interaction function has (on the Fermi surface) the form 


(ppm Inh?) f (p, p') = F(8)+¢.0'G(6), (2.4) 
where o and o’ are the Pauli matrices acting on the corresponding spin indices 
(i.e. corresponding to the variables p and p’), and F and G are two functions 
of the angle ® between p and p'.t The form of this expression arises from a 
characteristic property of the exchange interaction, which is independent of the 
spatial orientation of the total angular momentum of the system, so that the 


t In explicit matrix form, 


(ppm*/2°h*) fay, ps = Fo .poya + Gap Oyô. (2.4a) 
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two spin operators can appear in it only as a scalar product. The functions 
F and G as defined by (2.4) are dimensionless. The factor separated for this 
purpose on the left of (2.4) is the number of quasi-particle states on the Fermi 
surface per unit energy interval: 


v(er) = [2dt/delemey = 


(2ntip \ de 


2-4p2 (dp 
(ze) 
Pr 


or 
vp = p27 op = prm* |k. (2.5) 


Since the trace of a Pauli matrix is zero, the second term in (2.4) vanishes 
when the trace tr’ is taken, and tr’ fis independent of o also. This in fact also 
happens when the spin-orbit and spin-spin interactions are taken into account. 
The reason is that the scalar function tr’ f could contain the spin operator only 
as the product §-p xp’ of the two axial vectors § and p xp’; expressions quad- 
ratic in the components of § need not be considered, since for spin 4 they reduce 
to terms linear in § or independent of §. But this product is not invariant under 
time reversal, and therefore cannot appear in the invariant quantity tr’ f. 

The following notation will be convenient: 


Sey, BP» P') = apf (p, P), S= ztrtr' f. (2.6) 
From the expression (2.4), we have 
(prm* |n?h) f (8) = 2F(8). (2.7) 


The quasi-particle interaction function satisfies a certain integral relation 
which follows from Galileo’s principle of relativity. A direct consequence of 
this principle is that the momentum of the liquid per unit volume is equal to 
its mass flux density. The velocity of a quasi-particle is 0e/Op, so that the quasi- 


particle flux is 
tr | n(Gé/Op) dr. 


Since the number of quasi-particles in the liquid is the same as the number of 
actual particles, it is clear that the total mass transfer by quasi-particles is 
found by multiplying their number flux by the actual particle mass m. Thus we 


obtain the equation 
tr | på dr = tr | m(0è/ðp)A dr. (2.8) 


Putting n, = NÒ,p» Eag = €6,, we vary both sides of (2.8), use (2.1), and 
take f from (2.6): 


fe ôn dt = m lex dem | U, ôn' dr dt’, 


Oe „n On’ ) 
= m | 3p ndm | fp atr dt dt’, 
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where n’ = n(p’); in the second integral, we have renamed the variables, and 
integrated by parts. Since ôn is arbitrary, this gives the required relation: 


p/m = ĉe/ðp— f f(p, p’) [On(p’)/Op'] av’. (2.9) 


For a step function n(p’) = @(p’), the derivative On'/@p’ reduces to a delta 
function: 
06(p)/Op = —(p/p) d(p—pr). (2.10) 


Substituting the function «(p) from (1.12) in (2.9), and then replacing the 
momentum p = pn everywhere by the value pp = pm on the Fermi surface, 
and multiplying both sides of the equation by pp, we get the following relation 
between the mass m of the actual particles and the effective mass of the quasi- 


particles: 


1 
Litaa | FØ) cos do’, (2.11) 


where do’ is the element of solid angle in the direction of p’. If we substitute 
here the expression (2.7) for /(9), this equation becomes 


m*/m = 1+ F(#) cos ð, (2.12) 


where the bar denotes averaging over directions, i.e. integration over do’ /4a = 
= 5 sind dô. 

Let us also calculate the compressibility of a Fermi liquid at absolute zero, 
i.e. the quantity u2 = OP/0¢.* The density of the liquid is ọ = mN/V, so that 


u? = —(V?/mN) OP/OV. 


To calculate this derivative, itis convenient to express it in terms of the deriv- 
ative of the chemical potential. Since the latter depends on N and V only 
through the ratio N/V, and for T = constant = 0 the differential du = VdP/N, 
we have 


i aren eee a pe 
oN NV WN? CV’ 
and hence 
N Ou 
Beet ee ge 
u? = m ON’ (2.13) 


Since 4s = e, for T = 0, the change ôu when the number of particles changes 
by ôN is 
ôu = Jf (pr, p’) ôn' de’ +(Ger/Opr) Spr. (2.14) 


t When T = 0, S = 0 also, and so there is no need to distinguish the isothermal and adia- 
batic compressibilities. The quantity u is defined by the usual expression for the velocity of 
sound in the liquid. It must be borne in mind, however, that at T = 0 ordinary sound in fact 
cannot be propagated in a Fermi liquid; see §4. 
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The first term here is the change of e(pp) due to the change in the distribution 
function. The second term occurs because a change in the total number of 
panicle also affects the value of the limiting momentum: from (1.1), 6N = 
= Vp-6p,/7?h3. Since ôn’ is appreciably different from zero only when p’ ~ pr, 
we can write, replacing fin the integral by its value on the Fermi surface, 


ee f ,2d' _ 1 ôN 
[ fèn dt es, [ja fon tn = 3 Var: 


Substituting this expression in (2.14) and putting 0¢,/Op; = prim", we obtain 
On f mR 


aN = yt ppm Y : (2.15) 
Finally, with 1/m* from (2.11) and again using (1.1), we have 
2 _ Pe l pe 
w= PE ha a) | EO (1—cos 8) do’. (2.16) 


With /(8) from (2.7), and using (2.12), we can put this expression in the form 


a 


t= 





s+ FO). (2.17) 


The function f must satisfy certain conditions that result from the require- 
ment of stability of the ground state of the liquid. This state corresponds to 
occupation of all quasi-particle states within the Fermi sphere, and its energy 
must be a minimum with respect to any small deformation of the sphere. We 
shall not give the calculations in full, but only the final result, which may be 
conveniently expressed by expanding the functions F(#) and G(#) from (2.4) in 
Legendre polynomials: 


F(#) = § (21+ 1) FiP(cos 8), G(8) = F (2/+1)G;P,(cos 8); (2.18) 
l l 


with this definition, the coefficients F,and G, are the mean values of the prod- 
ucts FP, and GP,. Then the stability conditions are the inequalities 
Fi+ 1 > 0, (2.19) 
Gı+1 > 0. (2.20) 


A comparison of (2.19) for / = 1 with the expression (2.12) for the effective 
mass shows that the latter is positive. The condition (2.19) for / = 0 ensures 
that (2.17) is positive.* 


t See I. Ya. Pomeranchuk, Soviet Physics JETP 8, 361, 1959. 


t For / = 1, we also have the inequality F, > G,, as shown by A. J. Leggett, Annals of 
Physics 46, 76, 1968. 
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§ 3. Magnetic susceptibility of a Fermi liquid 


A quasi-particle with non-zero spin has in general a magnetic moment also. 
For spin +, the operator of this moment is So (the z-component of the magnetic 
moment is +f). The constant 28/4 which gives the ratio of the magnetic moment 
of the quasi-particle to its angular momentum 44 is equal to the corresponding 
constant for actual particles; clearly the value of this ratio is unchanged, which- 
ever way the particle spins are added to the quasi-particle spin. 

The existence of the magnetic moment of a quasi-particle leads in turn to 
to a paramagnetism of the liquid. The corresponding magnetic susceptibility 
may be calculated as follows. 

For a “free” quasi-particle, the operator of its additional energy in a mag- 
netic field H would be —foe.H. In a Fermi liquid, however, we must take it 
into account:that the interaction of the quasi-particles causes the energy of 
each of them to change, because of the changed distribution function in the 
magnetic field. In calculating the magnetic susceptibility, we must therefore 
write the quasi-particle energy change operator as 


dé = —Bo.H+tr’ f fon’ dr’. (3.1) 


The change in the distribution function is given in terms of ôè by 64 = (On/0e) 
6@;' we thus have 


6&(p) = —fo.H+tr’ f f(p, p’) (dn'/de’) 5é(p’) dt’. (3.2) 


We shall need the solution of this equation only on the surface of the Fermi 
sphere, and seek it in the form 


6é = —+fee.H, (3.3) 

where g is a constant. For a step function n(p’) = 9(p’), we have 
dn'/de’ = —6(e’—€r), 
so that the integration over dp’ = de’/v, reduces to taking the value of the 
integrand on the Fermi surface. Substituting f from (2.4) and noting that the 
Pauli matrices satisfy 
tra =0, tr(¢.0’)o’ = 40 tr’ o’.0’ = 20, 
we find 
g = 2—gG(0), 

or 

g = 2/01 +G), (3.4) 


where the bar again denotes averaging over directions, as in (2.12). 


+ In calculating the field-dependent increment 6n, we may neglect the change in the chemi- 
cal potential. The change in the macroscopic quantity u in an isotropic liquid can only be 
quadratic in the field H (which is assumed to be small in the calculation of the susceptibility), 
whereas 6€ is of the first order in the field. Since the magnetic susceptibility of the liquid is 
small, we need not distinguish between the field and the induction in it. 
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The susceptibility y is determined from the expression for the magnetic 
moment per unit volume of the liquid: 


ZH = ftr f o ôñ dr = ftr f oôê(ôn/ 3E) dr 


or, after integration with the step function n(p), 


m* a 
4H = —8 ae tr 0dé( pr). 





Finally, substituting (3.3) and (3.4), and noting that tr(¢.H)o = 2H, we find 


Sprm vB 
X= PUO U+’ 3.) 


where y is the coefficient in the linear specific heat law (1.15). The expression 
4 = 3yr? gives the susceptibility of a denegerate Fermi gas of particles with 
magnetic moment f; see Part 1, (59.5). The factor 1/(1+G) represents the 
difference between a Fermi liquid and a Fermi gas.‘ 

The stability condition (2.20) with / = 0 is the same as the condition y > 0. 


§ 4. Zero sound 


Non-equilibrium states of a Fermi liquid are described by quasi-particle 
distribution functions that depend not only on the momenta but also on the 
coordinates and time. These functions A(p, r, t) satisfy a transport equation 


dn/dt = (A), (4.1) 


where I(A) is the collision integral, giving the change in the number of quasi- 
particles in a given element of phase volume because of collisions between 
them.* 

The total time derivative in (4.1) includes both the explicit dependence of 
ñ on t and the implicit dependence due to the change in the coordinates, mo- 
mentum and spin variables of the quasi-particle in accordance with its equations 
of motion. The distinctive feature of the Fermi liquid is that, since the quasi- 
particle energy is a functional of the distribution function, in an inhomoge- 
neous liquid, £ as well as A depends on the coordinates. 

For distributions A that differ only slightly from the equilibrium distribution 
No, we write 

A(p, £, t) = no(p)+ dA(p, r, t). (4.2) 


t For He’, G = — 2/3. 

+ This section assumes familiarity with the transport equation and in that respect goes 
outside the scope of the book. However, the theory of Fermi liquids would be incompletely 
formulated without the transport equation (and its application in 884 and 5). We shall here 
need only the equation without the collision integral; problems involving the specific form of 
that integral will be discussed in another volume which deals with physical kinetics. 
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The quasi-particle energy is then ê = o+ ôĝê, where £o is the energy correspond- 
ing to the equilibrium distribution, and 6é is given by (2.1), so that 
oê 06é 


ñl 
CLL | fo. P ae) ae (4.3) 


v 


> = 
If there is no external magnetic field, £o and no are independent of the spin. 
The explicit time-dependence of f gives a term in dñ/dt 


On/Ot = 06/0. 
The dependence through the coordinates and momentum gives terms 


ALI esc 

a’ ap’ p- 
The quasi-particle energy é plays the role of the Hamiltonian. From Hamilton’s 
equations, 


? = d¢/Op, $= —dé/or. 
Hence we have, as far as the terms of the first order in 6%, 


OÂ O& Ono O0é 

Or’ Op Op’ Or’ 
Finally, the time variation of the function # as an operator with regard to the 
spin variables is given, according to the general rules of quantum mechanics, 
by the commutator 


(i/h) [é, A]. (4.4) 


However, when no and zo are independent of the spin, there are no terms of 
the first order in 6/ in this commutator. 
Collecting the various terms, we obtain the equation 


oñ Oco Oô OE Ono 
TE Oe OF on 
Before going on to apply the transport equation, let us discuss the conditions 
for it to be valid. By using the equations classical with regard to coordinates 
and momentum, we have assumed the motion of the quasi-particles to be quasi- 
classical; essentially the same assumption already underlies the description of 
the liquid by a distribution function that depends on both the coordinates and 
the momenta of the quasi-particles. The condition for quasi-classical motion is 
that the quasi-particle de Broglie wavelength fi/p,; be small compared with the 
characteristic length L over which n varies considerably. Using instead of L 
the “wave number” of the inhomogeneity, k ~ 1/L, we can write this condition 
ast 


= (â). (4.5) 


hk < pr. (4.6) 


t According to the definition (1.1), 4/pp is of the order of the interatomic distances, so that 
the condition (4.6) is very weak. 
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The frequency œ of the distribution function variation that is established for 
a given k is of the order of vpk, and automatically satisfies the condition 


hw < EF. (4.7) 


There may be any relation between fiw and the temperature T. If fiw >T, 
the width of the transitional zone of the distribution function is fiw; then 
(4.7) is the condition necessary for the entire theory to be valid, ensuring that 
the quantum uncertainty in the quasi-particle energy (due to their collisions) 
is small compared with fw. 

Let us now apply the transport equation to investigate vibrational motion in 
a Fermi liquid. 

At low (but non-zero) temperatures, collisions occur between quasi-particles 
in a Fermi liquid, and the mean free time t oc T-?. The nature of the waves 
propagated in the liquid essentially depends on the value of wt. 

When wt < 1 (which is effectively the condition for the quasi-particle mean 
free path / to be small compared with the wavelength 2), the collisions are able 
to bring about thermodynamic equilibrium in each volume element (small 
compared with A) in the liquid. This means that we have ordinary hydrodynam- 
ical sound waves propagated with velocity u = +/(@P/0e). The absorption of 
sound waves is small when wt <1, but increases with wt, and for wt ~ 1 
becomes very strong, so that the propagation of sound waves becomes im- 
possible. 

When ot increases further to wt >> 1, wave propagation again becomes 
possible in the Fermi liquid, but the waves have a different physical character. 
In these vibrations, collisions of quasi-particles are unimportant, and thermo- 
dynamic equilibrium is not established in each volume element. The process 
may be regarded as occurring at absolute zero of temperature. These waves 
are called zero sound. 

According to the above discussion, the collision integral in the transport 
equation can be omitted when wr >> 1; then 


5A, 5A Ano OE _ 


Of ae” op oe S or) 


where v = 0e/Op is the quasi-particle velocity calculated from the unperturbed 
energy €(v = vmn, where n isa unit vector in the direction of p); the suffix 
‘0 is omitted from e here and henceforward. 

When T = 0, the equilibrium distribution function no is a step function 
O(p) cut off at the limiting momentum p = pr. Its derivative is 


Ono/Op = —nd(p—pr) = —vo(e— er). 


t When wr < 1, the sound absorption coefficient y ~ ?7/gu°, where 7 is the viscosity of 
the liquid. In order of magnitude, u ~ vp, 7/0 ~ vpl ~ viet, where vp is the quasi-particle 
velocity (independent of the temperature), so that 7 x T~? (1. Ya. Pomeranchuk 1950). Then 
yujw ~ wt € wT”. 
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Assuming that the time and coordinate dependence of ôñ in the wave is 
given by the factor exp[i(k.r—ot)], we shall seek the solution of the transport 
equation in the form 


ñ = S(e— ep) 9(n) efkr—on) , (4.9) 


Then (4.8), with 06é/Cr from (4.3), becomes 


2 
(œ — vrn.k)?(n) = n.k aay tr’ f f(n, n')9(n’) do, (4.10) 
where n and n’ are unit vectors in the directions of p and p’, and the integration 
is over the directions of n’. 

Let us consider (zero sound) vibrations which do not affect the spin prop- 
erties of the liquid. This means that both the equilibrium distribution func- 
tion and also its “perturbation” 6n are independent of the spin variables. 
In such a wave, the change in the distribution function during the vibrations 
amounts to a deformation of the limiting Fermi surface (a sphere in the un- 
perturbed distribution), which remains a sharp boundary between the occupied 
and unoccupied quasi-particle states. The function »(n) is the displacement 
(in units of energy) of this surface in a given direction n. 

Since v(n’) is independent of the spin variables, the operation tr’ in (4.10) 
applies only to f. Writing fin the form (2.4), we have tr’ f = (27h3/p,m"*) F(8). 
Thus the operator o no longer appears in the equation, which now becomes 


(@—k.y) x(n) = k.v f F(8) x(n’) do’ /4x. (4.11) 


We take the direction of k as the polar axis, and define the direction of n 
by angles @ and @. Introducing the wave propagation velocity up = w/k and 
the notation $ = uo/v,, we can write the equation in the final form 


(s—cos 8) (8, p) = cos 8 f F(8) »(8’, $’) do’/4z. (4.12) 


This integral equation determines, in principle, the wave propagation veloci- 
ty and the function »(n’) in the waves. We see at once that, for undamped 
vibrations (the only ones considered here), s must exceed unity, i.e. 


Uo > Uf. (4.13) 
The origin of this inequality can be understood if we rewrite (4.12) as 


HO, p) do’ 
s—cos 9 4n’? 


5(8, $) = cos 0 [r® 
where v has been replaced by another unknown function % = (s—cos 6)». 
When s = w/kv,; < 1, the integrand has a pole at cos 6’ = s, and in order 
to make the integral meaningful this pole in the plane of the complex variable 
cos 9’ must be avoided by some definite rule. This adds an imaginary part to 
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the integral; the frequency w therefore also acquires an imaginary part (for 


a given real k), and the wave is damped. The physical significance of the equa- 
tion cos 0 = to/vr, corresponding to the pole, is that this is the condition for 


the quasi-particles to emit Cherenkov waves of zero sound.t 

As an example, let us consider the case where F(#) is a constant Fo. The in- 
tegral on the right of (4.12) is then independent of the angles 6 and ¢, and the 
required function » is therefore 


0 
v = constant x 2S (4.14) 


S—cos 0 ` 
The Fermi surface thus becomes a surface of revolution elongated in the for- 
ward direction of wave propagation and flattened in the opposite direction. 
This anisotropy is a consequence of the non-equilibrium state of the liquid in 
each of its volume elements: in equilibrium, all properties of the liquid must be 
isotropic, and the Fermi surface must therefore be spherical. For comparison, 
it may be mentioned that an ordinary sound wave corresponds to a spherical 
Fermi surface with oscillating radius (the limiting momentum p,; varies with 
the density of the liquid), shifted as a whole by an amount depending on the 
velocity of the liquid in the wave; the corresponding function v is v = ôpp+ 
+ constant xcos 6. 

To find the zero sound wave propagation velocity uo, we substitute (4.14) 
in (4.12): 


(a 2x sin 6 d0 i 
0 ee os 


s—cos 0 4n 
0 


On integrating, we get an equation which implicitly determines uo for a given 
value of Fo: 

1 s+1 
=> — -l1 = . 15 
5 Slog 5-1 = I/Fo (4.15) 


The function on the left decreases from infinity to zero when s varies from 1 to 
co, and is always positive. Hence it follows that the waves concerned can exist 
only when Fo > 0. It should be emphasized that the possibility of propagation 
of Zero sound thus depends on the properties of the interaction of the quasi- 
particles in the Fermi liquid. 

When Fo — 0, (4.15) shows that s tends to unity: 


s-1 = 4 e-2IF (4.16) 


t This is called Landau damping; it will be discussed in detail in connection with plasma os- 
cillations, in the last volume of the course. The rule for avoiding the pole in the integral is given 
by replacing w by w+ 70 (i.e. s + s+ i0); this signifies that the perturbation is made finite at all 
previous times (including 1 — — co). 
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This case has more general significance than (4.15) (in which it is assumed that 
F is a constant = Fo): it corresponds to zero sound in an almost ideal Fermi 
gas for any function F(#). An almost ideal gas has F(#) small in magnitude. 
It is seen from (4.12) that s is then close to 1, and v is appreciably different 
from zero only for small angles 6. Hence, considering only the range of small 
angles, we can replace F(#) on the right of (4.12) by its value when # = 0 
(which corresponds to 6 = 9’ = 0). We then return to (4.14) and (4.16), with 
the constant Fo replaced by F(0).* In a slightly non-ideal gas, the velocity of 
zero sound exceeds that of ordinary sound by a factor 4/3: for the former 
Mo © Up, and for the latter (2.17) gives (with F neglected and m ~ m*) u = 
x p}/3m*? = v;/3. 

In the general case of an arbitrary function F(#), the solution of (4.12) is 
not unique. The equation, in principle, allows the existence of various types 
of zero sound differing in the angular dependence of the amplitude »(9, ¢) and 
propagated at various velocities. As well as the axially symmetrical solutions 
(8), there can also exist asymmetric solutions in which v contains azimuthal 
factors e*’"* with integral m (see Problem). For all such solutions, the inte- 
gral | v do = 0,1.e. the volume within the Fermi surface is fixed. This means 
that the vibrations do not alter the density of the liquid. 

The possibility of wave propagation in a Fermi liquid at absolute zero im- 
plies that its energy spectrum may contain a branch corresponding to elemen- 
tary excitations with momentum p = Åk and energy ¢ = fiw = uop, which are 
“quanta of zero sound”. The fact that zero sound (with any k) can have an 
arbitrary (small) intensity means, in terms of the elementary excitations, that 
these can occupy their quantum states in any numbers; that is, they obey 
Bose statistics and form what is called the Bose branch of the spectrum of the 
Fermi liquid. It must be stressed, however, that in the Landau theory it would 
be improper to apply the corrections, corresponding to this branch, to the 
thermodynamic quantities for the Fermi liquid, since these contain higher 
powers of the temperature (T? in the specific heat) than even the first corrections 
to the approximate theory given above. 

The problem of the absorption of zero sound requires a consideration of the 
collisions of quasi-particles, and is outside the scope of this book. 


PROBLEM 
Find the velocity of propagation of asymmetric waves of zero sound when F= F, + F, cos 0. 


SOLUTION. When 


F = Fa F,[cos 6 cos 8’ + sin 6 sin & cos(’ -@)], 


t Vibrations corresponding to zero sound in a slightly non-ideal Fermi gas were first 
discussed by Yu. L. Klimontovich and V. P. Silin (1952). 
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there can exist solutions with voce+: putting v = /(A)e%, substituting in (4.12) and aiearaing 
over d’, we obtain 
T 
(s—cos 9) f = 1 F, cos 9 sin 0 f sin? OAO) de. 
0 
Hence 
sin ĝ cos , b 
-e e š 


y = constant x 
s—cos 9 


Substituting this expression back into the equation, we get 


~“ sin? 6 cos 0 
f — dl = 4/F,, 


s— cos 0 


which gives the dependence of the velocity of propagation on F}. The integral on the left is 
a monotonically decreasing function of s. Its greatest value therefore occurs when s = 1. Cal- 
culating the integral for s = 1, we find that an asymmetric wave of the type considered can be 
propagated if F, > 6.T 


§ 5. Spin waves in a Fermi liquid 


As well as the spin-independent solutions »(n) considered in §4, (4.10) has so- 
lutions of the form 
> = o.u(n), (5.1) 


in which the variation of the quasi-particle distribution function depends on 
the spin component. These may be called spin waves. 

Substituting (5.1) in (4.10), again taking fin the form (2.4), and noting that 
tr’ o'(o.0’) = 26, we get (after cancelling ©) 


(s—cos 6) u(0, 6) = cos 0 | GC) u(6", ¢') do’/4z. (5.2) 


Thus, for each component of the vector u, we get an equation that differs 
from (4.12) only in that F is replaced by G. Hence the subsequent calculations 
in §4 are applicable to spin waves also.* 

Spin waves of another kind can be propagated in a Fermi liquid when a 
magnetic field is present (V. P. Silin 1958). Here we shall consider only vibra- 
tions with k = 0, in which 6/ is independent of the coordinates. 

When a magnetic field H is present, even the quasi-particle energy and distri- 
bution function “unperturbed” by the vibrations are spin-dependent. These 
dependences are interrelated, and are given by (see §3) 


ĉo = eo(p)—fis-H, fi = £/(1+G), (5.3) 
Ao = no(p)— (dno/de) B1¢.H 
= no(p)+ ô(e— EF) ĝı0.H, (5.4) 


t For liquid He?, Fy and F; can be calculated from the known values of m* and u? by means 
of (2.12) and (2.17): Fy = 10.8, Fy = 6.3 (at zero pressure). 


* In liquid He?, Gy = G(®) <0; see the second footnote to §3. Such waves therefore cannot 
be propagated in it. 
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where zo(p) is the energy in the absence of the field; the suffix 0 again indi- 
cates that these expressions relate to the equilibrium liquid. 
We again seek the small variable part of the distribution function in the 
wave in the form 
ñ = 8(e— er) o.(mje~. 


The corresponding change in the quasi-particle energy is 
do’ __. 
o— , —iwt 
6é = o. | u(r \GA) me 


In the transport equation, we must now take into account the term (4.4) 
containing the commutator [é, A]; for distributions independent of the coor- 
dinates, it becomes 


Leie nh] = 0. (5.5) 
As far as terms linear in 6/ we have 
[é, A] = —filo.H, 64]+816(e— er) (62, o.H]. 
The commutators are given by the formula 


[o.a, o.b] = 2io.a xb, 


where a and b are any vectors; see QM (55.10). The transport equation thus 
becomes 


ioy(n) = (281/4) H xe(n), (5.6) 
p(n) = w(n)+ f a(n’) G(8) do’/4z. (5.7) 


In the general case, the solution of (5.6) can be expanded as a series of spher- 
ical harmonics Y,„(0, $), with the polar axis along H. Each term in the expan- 
sion represents a particular type of vibration with its frequency wm 

The first frequency woo corresponds to vibrations with p. = constant; then 
e = w(1+G), and (5.6) becomes 


iwo = (26/AH xy; 


where 


the vibrations are transverse to the field (u LH). Writing the equation in 
components in the plane perpendicular to H and taking the determinant, we 


find the frequency 
Moo = 2BH/h. (5.8) 


Here £ is the magnetic moment of a particle (actual) in the liquid. Thus woo is 
independent of the specific properties of the liquid. The values of all the other 
frequencies w, however, depend on the specific form of the function G(8). 
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§ 6. A degenerate almost ideal Fermi gas with repulsion between 
the particles 


The problem of the thermodynamic properties of an “almost ideal” degenerate 
gas has no direct physical significance, since the gases that actually exist in 
Nature condense at temperatures near absolute zero. Nevertheless, in view of 
the considerable methodological interest of this problem, there is value in 
discussing it for a hypothetical model of a gas whose particles interact in such 
a way that the gas cannot condense. 

The condition for the gas to be almost ideal is that the range ro of the molec- 
ular forces be small compared with the mean distance / ~(V/N)”* between 
the particles. As well as the condition ro « /, the inequality 


proh «1 (6.1) 


is valid for the particle momenta p: in a degenerate Fermi gas, the limiting 
momentum pp is estimated from (1.1), which gives pp/ň ~ (N/V)"3 « 1/ro. 

We shall consider here only a pair interaction between particles, and assume 
for simplicity that the interaction U(r) is independent of the particle spins. 
Our aim is to calculate the leading terms in the expansion of the thermody- 
namic quantities in powers of the ratio ro//, by means of quantum-mechanical 
perturbation theory. The difficulty is that, because of the rapid increase of the 
interaction energy at small distances between the particles, perturbation theory 
(the “Born approximation”) is in fact not applicable to particle collisions. 
This difficulty can, however, be circumvented in the following way. 

In the limiting case of “slow” collisions (as for instance when the condition 
(6.1) holds), the mutual scattering amplitude of particles with mass m tends to 
a constant limit —a, which in the Born approximation (see QM, (126.13)) is 


—a = —mU%/4ah®, Uo = [U(r) dx; (6.2) 


this limit corresponds to the s state of the pair of particles (with spin +). The 
constant a is called the scattering length.‘ Since this quantity entirely deter- 
mines the properties of the collisions, it must also determine the thermodynamic 
properties of the gas. 

This leads to the possibility of applying a procedure known as renormalization. 
We formally replace the true energy U(r) by a different function having the 
same value of a but such that perturbation theory can be used. So long as 
(i.e. in an approximation such that) the final result of the calculations contains 
U only in the scattering amplitude, it will be the same as the result that would 
be given by the actual interaction. 


t The expression (6.2) takes no account of the quantum-mechanical identity of the particles. 
In the limit of slow collisions of identical spin-3 particles, scattering occurs only for anti- 
parallel spins, and the differential cross-section for scattering into the solid angle do (in the 
centre-of-mass system) is do = 4a” do; the total cross-section is obtained by integrating do 
over a hemisphere, and iso = 87a" (see QM, §137). 
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The range of the actual interaction is in general the same in order of magni- 
tude as the scattering length a. For the fictitious field U(r) which serves an auxi- 
liary purpose, the condition for the Born approximation to be valid is a < ro. 
The actual small expansion parameter in this theory is, of course, ap,/h. 

We shall need the relation between Uo and a not only in the first approxi- 
mation (6.2), but also in the second Born approximation. To find this, we re- 
call that, if the transition probability of the system under the action of a con- 
stant perturbation V is given in the first approximation by the matrix element 
Voo, then in the second approximation Foo is replaced by 


where the summation is over the states (with n = 0) of the unperturbed system 
(see OM, §43). In the present case we have a system of two colliding particles, 
and the perturbation is their interaction U(r). The perturbation matrix ele- 
ments for transitions in which the particle momenta pı and pz become p; 
and p, (with pit+pe = py+p,) are 


1 
(Pian, pia | U| pita, Bats) = yy | U(r) e- Heath dèx, (6.3) 
where p = p,—p2 = —(pi— pı); since the interaction is independent of thé 


spins, the particle spin components a and a, are unaltered by the collision. 
The matrix element for zero momenta Uo/V plays the role of Voo. Thus, in 
changing from the first to the second approximation, we must replace Uo by 


1 | [Pi +p] cia dinates 
Yor yd a aa | f Ver foam ax |?; 


the summation is for given p; and ps, over p; ¥ Pi, Pe. Since in our case the 
particle momenta are assumed small, in all the important terms in the sum we 


can replace the matrix elements by their values at p = 0. We then get the 
following expression for the scattering length:t 


m U2 2m 
a eee ieee 6.4 
l = and | Yor V 2 ATA PTA ee 


Hence, with the same accuracy, 





2 2 
2 na [ Anh2a 2m | l (6.5) 


Uy = 1-—- F = 
° o mV Lt ph pepe 


t In all the intermediate formulae we write the sums over discrete values of the particle 
momenta with the particles in a finite volume V; in the final calculation the summation is 
replaced, as usual, by integration over Vd°p/(27h)°. 
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The divergence of the sum in (6.4) for large p; and p, is due to the replace- 
ment of all matrix elements by constants and is not important, since, when 
this expression is later used to calculate the energy of the system, a convergent 
expression is still obtained, in which large momenta are not significant. We 
take a to be the scattering length of slow particles, which is independent of 
their energy. Formula (6.4) seems at first sight to depend on the momenta pı 
and pz, but in fact this dependence is restricted to the imaginary part of the 
scattering amplitude (which exists when the method of summation is appro- 
priately defined; cf. QM, (130.9)), which need not be considered, since we know 
that the final result will be real. This topic will be resumed in §21. 

In the present section, we shall consider the model of a Fermi gas with a 
repulsive interaction between the particles; for such an interaction, a > 0. In 
this case, the gas has an energy spectrum of the Fermi type described in §§1 
and 2. 

The Hamiltonian of a system of particles (with spin 4) having a pair inter- 
action is, in the second quantization method, 


H= 2o Zm ĉr Bae, ty F (Die, Po%e | U | prey, Pos) Gy, piar ws Apyas4pya, > (6.6) 


see QM, §64. Here ât, and 4,, are creation and annihilation operators for a 
free particle with momentum p and spin component «(= +4). The first term 
in (6.6) corresponds to the kinetic energy of the particles, and the second term 
to their potential energy; in the latter, the summation is over all values of the 
momenta and spin components, subject to the conservation of momentum 
in the collisions. 

In accordance with the assumption that the particle momenta are small, 
we again replace the matrix elements in (6.6) by their values for zero momenta: 
(0x1, Ox2| U |Ox1, Oxa) = Uo/V. Next we note that, since the operators 4, ,, and 
Gy. anticommute in Fermi statistics, their product is syama with 
respect to the interchange of suffixes; the same applies to the products 4;,,, Ca 
In consequence, all terms cancel in the second sum in (6.6) that contain pairs 
of equal suffixes «1, xa (physically, this occurs because of the.fact already men- 
tioned that, in the limit of slow collisions, only particles with opposite spins 
can scatter each other). 

The Hamiltonian of the system thus becomes 


= Don Sm — diye + 2 >, tâ tâ, â, (6.7) 


Pis Pas Pi 


where 4,, = âp+ Ĝi+4 = 4,4, etc., and the suffixes + and — here and hence- 
forward replace +4 and — 4. 

The eigenvalues of this Hamiltonian are calculated by ordinary -perturbation 
theory; the second term in (6.6) is treated as a small correction to the first 
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term. The first term is diagonal, and its eigenvalues are 


EO = X, (p?/2m) npa» (6.8) 


P, & 


where n,, are the occupation numbers of the states p, «.' 

The first-order correction is given by the diagonal matrix elements of the 
interaction energy: 

Uo 
EP = V X Ny4N2~, (6.9) 
Py, P2 

where nı, = Mp,4 etc. 

To find the second-order correction, we use the known formula of pertur- 


bation theory, 
, | Vnm ? 


EP =F EE,’ 


where the suffixes n and m label the states of the unperturbed system. A simple 
calculation (with the known matrix elements of the operators â, and âf,) gives 


UZ Mpte(1—mi4) (Lm) io) 
Us Pi» Px, Pi (pi+p3—p\2—Pp,")/2m 


The structure of this expression is very clear: the squared matrix element of the 
transition pi, P2 — Pi, pz is proportional to the occupation numbers of the states 
Pi, p2, and to the numbers of unoccupied positions in the states pj, px. 

The integral Uo in (6.9) and (6.10) must be expressed in terms of a real phys- 
ical quantity, the scattering amplitude —a. In the second-order terms this can 
be done from (6.2); in the first-order terms, the more exact formula (6.5) is 
needed. After these substitutions, we find as the correction of the first order 
ina 

EO = E To nne- (6.11) 


Pi» P2 


and as the second-order correction 


pe — 2ng? x ninad mis) U —n-)-1] 
Pi» P2» Pi Pit p? =p? =p? 


for brevity, we use in the intermediate formulae the “coupling constant” of 
the gas particles? g = 4r?a/m. In expanding the expression in the numerator, 


t By assuming that the particles have definite values of the spin component, we assume that 
the statistical matrix n,g(p) is also reduced to diagonal form; the functions n,(p) with œ = +3 
are then its diagonal elements. 

+ After the renormalization of the scattering amplitude, this quantity is no longer equal to 
the constant U, in (6.2). 
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we note that the terms with products of four n cancel, because their numerators 
are symmetrical and their denominators antisymmetrical with respect to the 
interchange of px, pe and pi, p; and the summation over these variables is 
symmetrical. The final result is 


2mg? nı+n2-(ni+ +n) 
E = — Se. 6.12 
Dk FAPEP wey 





Pi» Pz» P; 


This sum (in which all the n,, + 0 as p — œ) is convergent. 

From these formulae we can calculate, first of all, the energy of the ground 
state. To do so, we must put all the ,, equal to unity within the Fermi sphere 
(p < pr = h(3n2N/V)*) and zero outside. Here it should be noted that, al- 
though in the original Hamiltonian the eigenvalues of the operator products 

+4,, give the occupation numbers of the states of the gas particles themselves, 
after diagonalizing the Hamiltonian by means of perturbation theory we are 
concerned with the quasi-particle distribution function (denoted, as in previous 
sections, by n,,). 
Since $n, = $n, = 2N, we find from (6.11) the first-order correction 


E® = gN*/4V. 
In (6.12) we replace the summation over three momenta, together with the 
condition pi+p2 = pi +Pp,, by integration over 


y3 , , 1 r 
(Quik (p1 + P2— Pi — P2) @*p1 Ape dp; dp, 


so that 





Amg’V_ f 5(pi+Pp2—Pi— Po) 
EP = l “PEMBEDA dp, Ppa dpi Pp 


the integration being taken over the range pı, po, pı = pr. The calculation of 
the integral’ gives the following final result for the energy of the ground state: 


3p% 10 pra 4(11—2log 2) /pra \? 
Eo = N 0m [i+ “A+ Bim ( h K ga 
where the coefficient of the square bracket is the energy of an ideal Fermi gas 
(K. Huang and C. N. Yang 1957). 

The chemical potential of the gas at absolute zero is given by the derivative 
u = (OE0/ON),. Expressed in terms of the limiting momentum py, it is 
DF | 4 pra 4(11—2 log 2) (E) l (6.14) 


Lesa Se Se ji 


t In practice, it is simpler to proceed in a different order, beginning with the calculation of 
the function f (see below), 
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According to the general ideas of the Landau theory, the spectrum of 
elementary excitations e(p) and the quasi-particle interaction function f,,(p, p’) 
are determined by the first and second variations of the total energy with 
respect to the quasi-particle distribution function." If E is written as a discrete 
sum over p and «, we have by definition 


SE = Y, elp) dtp tay E faw(PsP’) Spe Styes (6-15) 


Pa P, a, p’, a 


(after differentiation of the energy, n,, is to be replaced by unity within the 
Fermi sphere and zero outside). There is, however, no need to calculate in this 
way the effective mass m* of the quasi-particles, since it can be found more 
simply (see below). 

To calculate the function f.,,(p, p^) (on the Fermi surface), we twice differ- 
entiate the sum of the expressions (6.11) and (6.12), and then put p = p’ = pr. 
After making this simple calculation and changing from summation to integra- 
tion, we have 

; 4mg* 6(p+p’ —P1—P2) 

f,.-(P, p) = &- hY Neer ae 


| 
v 





Ô(Pp+pı—p'—p2)+ 6(p’ +-P1—P—P2) 
ii: 2 NE a i dpa, 
2(p?— p3) ail a 


J. +(p, pP) = f-_(p, p’) 


_ 2mg? f d(p+pi-—p’ —p2)+6(p'+pi—Pp— po) 
“ey ee 


The integration in these formulae is comparatively simple, because of the lower 
multiplicity of the integrals. 

The final result is to be put in the form (2.4), which is independent of the 
choice of the spin quantization axis. In this form it is 


2 in i 
2nah [pp eee (2+ cosy gg on 5 5) 
2sing0 1—sin}? 





ER 


Jay, g8 = 
J 


xA 





2apr toed 1+sin 4 
l + he (1 x SIN > p log ant) apan) > (6.16) 


where # is the angle between the vectors pp and pp (A. A. Abrikosov and I. M. 
Khalatnikov 1957).* 


* The matrix /,,/(p,p’) in this section is made up of the elements of the matrix Sey, BPP) 
that are diagonal in two pairs of suffixes (a, 6 and y, ô). 

* The function (6.16) becomes logarithmically infinite at ® = 2. This is because of the 
approximations made. A more exact analysis shows that, although ® = z is indeed a singular- 
ity of the function, the latter is zero there, not infinite; see the third footnote to §54. The 
invalidity of (6.16) near # = x is unimportant in subsequent applications, which invol ve in- 
tegrals convergent at this point. 
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The effective mass of the quasi-particles is found from this by integration as 
in (2.12): 





in apr 6.17 
r l+ gga (7108 2 ea ). (6.17) 


Formula (2.17) gives the velocity of sound in the gas: 


> _ P? [2 apr , 8(11-2 log 2) (apr \? 
oe tee (FE) | oo) 


Then, integrating u?m/N (expressed in terms of N/V instead of pp) with respect 
to N, we find from (2.13) the chemical potential of the gas, and a further 
integration with respect to N gives the expression (6.13) for the energy of the 
ground state. 

Formula (6.13) represents the first terms ir’ an expansion of the gas energy in 
powers of the “gaseousness parameter” n = p,a/h ~ a(N/V)"*. By similar but 
considerably more laborious calculations, we could derive some further terms 
in the expansion. The reason is that, in a Fermi gas, triple collisions contribute 
to the energy only in a fairly high approximation. Of three colliding particles, 
at least two have the same spin component; the coordinate wave function of the 
system must then be antisymmetric with respect to these two particles. Thus the 
orbital angular momentum of the relative motion of these particles is at least 
1 (p state). The corresponding wave function contains an extra power of p/h 
in comparison with the s-state wave function (see QM, §33), and therefore the 
probability of such a collision contains an extra factor p°, i.e. is reduced by a 
factor ~ (pa/h)? ~ nè in comparison with that of a “head-on” collision of 
particles not obeying the Pauli principle. In consequence, triple collisions con- 
tribute to the energy only in terms containing the volume as V-?V—?*. In other 
words, all terms in the expansion of the energy up to those of order N(pi./m)n? 
inclusive, i.e. three more beyond those shown in (6.13), are expressed in terms 
of the characteristics of pair collisions only. However, these characteristics 
will include not only the amplitude of s-wave scattering for slow collisions, as 
in (6.13), but also its derivatives with respect to the energy, and the amplitude 
of p-wave scattering. 


CHAPTER II 


GREEN’S FUNCTIONS IN A FERMI SYSTEM AT 
T=0 


§ 7. Green’s functions in a macroscopic system 


THE method used in §6 becomes laborious and in practice unusable in the higher 
orders of perturbation theory. This disadvantage is the more important in that 
the interaction between particles in actual physical problems is certainly not 
weak, and so, to ascertain the various general properties of macroscopic 
systems, we have to consider infinite sequences of terms in the perturbation- 
theory series. To overcome such difficulties, we can use a mathematical for- 
malism similar to the one in quantum field theory. 

The specific form of this treatment depends essentially on the nature of the 
macroscopic system to which it is to be applied. The subsequent sections of this 
chapter deal with the development of the formalism for a Fermi liquid at 
absolute zero. The purpose of the exposition is not only the practical applica- 
tion of the method to such a system, but also to show how the formalism itself 
is constructed. 

The starting-point is the second-quantized y operators, whose properties are 
known from quantum mechanics (see QM, §§64, 65). Here we shall need them 
in the Heisenberg representation, in which they depend explicitly on the time. 
We therefore begin by establishing some properties of the y operators in that 
representation. 

We shall consider systems of spin- particles. Accordingly, the y operators 
must be given a suffix that indicates the value of the spin component and takes 
the values +5; these suffixes will again be written as Greek letters, and summa- 
tion over repeated suffixes is implied. 

By the general rule (see QM, §13), the operator f (t) of any physical quantity 
in the Heisenberg representation is expressed in terms of the time-independent 
(Schrödinger) operator f of the same quantity by* 


f(t) = elit feih 
where #7 is the Hamiltonian of the system. 


t The systematic construction of this formalism is due to V. M. Galitskii and A. B. Migdal 
(1958). 

+ In order to simplify the formulae, we shall often use units such that the quantum constant 
h = 1 (so that the momentum and energy have dimensions of reciprocal length and recipro- 
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Here, however, it will be appropriate to modify this definition somewhat. 
The reason is that in quantum statistics it is more convenient to consider the 
states of the system not for a specified number N of particles in it but for a 
specified chemical potential u. The ground state of the system, in which it is 
found at T = 0, can then be defined as the state having the lowest eigenvalue 
of the operator 


A’ = À- uÑ, (7.1) 


and not of A as when N is specified: the probability that the system is (for a 
specified value of u) in a state with energy £, and number of particles N, is 


w ec exp {— aT HNn\ — exp (Fn) ; 
a a a ae 


see Part 1, (35.1). Here E; are the eigenvalues of the operator A’. We see that 
at T = 0 only the state with the lowest E; remains.* 
Thus we define the Heisenberg y operators by the formulae 


P(t, r) = eft (r) eA, 


W(t, r) = egt) eih, 


(7.2) 


The Heisenberg y operators will be denoted by the capital letter Ý, and the 
Schrödinger y operators by y. 

The Schrödinger y operators obey the familiar commutation rules. The 
commutators of the Heisenberg operators taken at different times t and t 
cannot be calculated in a general form, however. When t = ť', the commuta- 
tion rules are the same as for the Schrödinger operators. Thus, from the rule 


Pal JHO) Galt) = up O(r—r’) 
we have the corresponding rule 
Plt, r) P (t, + VF l, r) E, r) 
= eG) PE E) + PFE) GaN) eH 
= ôpôlr—r’). (7.3) 
Similarly, 


Pt, r) Palt, r+ Polt, r) Eal, r) = 0, | (7.4) 


tet, r) Pa Ct, r')+ V(t, r) HC, r) = 0. 





cal time respectively). To change from these to ordinary units, all momenta p and energies E 
in the formulae are to be replaced by p/f and E/f. Such units will, in particular, be used in the 
present chapter. p M 

t The term ‘Hamiltonian’ will be used for both H and 7’. 
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Differentiating the definition (7.2) with respect to time, we find that the 
Heisenberg y operator satisfies the equation 


-i EP, r) = APE, r)— P(t, r) H’; (7.5) 


cf. OM, (13.7). 

The Heisenberg and Schrödinger representations are identical for the oper- 
ator of any conserved quantity (i.e. an operator that commutes with the Hamil- 
tonian). This is true, in particular, of the Hamiltonian itself, and of the particle 
number operator, which also of course belongs to a conserved quantity. The 
expressions for these operators in terms of Schrödinger and Heisenberg y 
operators are the same. For example, the particle number operator is 


Ñ = f OF) Pelr) Bx 


= f Pin) alt, r) dx. (7.6) 
The Hamiltonian of a system of interacting particles is 
H = HOLPoOLPOs _.,, | 
Pos [ Pilt, r) A(t, r) dx — uN, | 
2m | (7.7) 


VO = | Pil, r) UVM E, r) ex, 
po 


“| Palt, r) V(t, r) UPr—r’) Y(t, r) Volt, vr) Bx Bx’, 


II 


where #7’ is the Hamiltonian of a system of free particles; VY is the operator 
of their interaction with the external field U(r); V the operator of their 
pair interaction, U“(r—r’) being the interaction energy of two particles. The 
omitted terms represent triple etc. interactions; cf. QM, (64.25). For simplicity, 
all interactions are assumed to be independent of the spins of the particles. 

The commutator of A’ and Ý, in (7.5) is calculated by means of the rules 
(7.3) and (7.4); the delta functions that appear are removed by integration. 
We thus obtain a “Schrödinger equation” for Ý (t, r), in the form 


ð 
i Pn r) = (-3g4- pt vew) P(t, r) 
+ f P(t, r) UL- r) Pelt, r) Bx’ Palt, r). (7.8) 


The concept of the Green’s function for a macroscopic system is fundamental 
in the method described here. This function is defined by* 

Gup(X1, X2) = —KTE(X1) PY). (7.9) 

Here and and below, X denotes for brevity the time t together with the position 


vector r. The angle brackets (...) denote averaging with respect to the ground 


* This definition is analogous to that of the exact Green’s functions (propagators) in quan- 
tum electrodynamics (cf. ROT, §§100, 102). 
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state of the system, instead of the more cumbersome notation (0! ... |0) for 
the diagonal matrix element. The symbol T denotes the chronological product: 
the operators following it are to be arranged from right to left in order of 
increasing times t, t2. For fermions, the interchange of a pair of y operators 
(as compared with their arrangement in the original writing of the product) 
must change the sign of the product. Explicitly, 


Gap(X, X2) = | — KPAX) Pi (X) for tr > te, | 


KPF (Xo) PAX) for h< te ve 


There are some obvious properties of the Green’s function. If the system 
is not ferromagnetic and not in an external field, the spin dependence of the 
Green’s function reduces to a unit matrix: 


Gup(X1, Xo) = bupG(X4, Xə); (7.1 1) 
any other dependence would distinguish a particular direction in space, the 
z-axis of spin quantization.’ Since time is homogeneous, tı and tz appear in 
the Green’s function only as the difference t = t1— fe. If also the system is 
microscopically homogeneous in space, the coordinates of the two points 
appear only as the difference r = r1—rg. In other words, for this case we have 


Gig Xi, Xo) = dupG(X), X = X1— Xo. (7.12) 


It must be emphasized that microscopic homogeneity means that the body is 
assumed homogeneous not only as regards its mean (macroscopic) density but 
also as regards the probability density of various (microscopic) positions of its 
particles in space. Liquids and gases have this property (but solid crystals do 
not). Their isotropy has the result that G(t, r) = G(t, —r). In this connection, 
let us note once again that the function G(t, r), by its definition, is certainly not 
an even function of t. The order of t, and tə in the difference t = tı— tə is for 
that reason significant. . 

The coordinate density matrix of a particle in the system is defined as the 
mean value 


daplt Ba) = EPC, ra) Pals, 1). (7.13) 


From a knowledge of this matrix we can find the mean value of any quantity 
pertaining to an individual particle. Let Fy be some “one-particle” operator, 
i.e, an operator of the form 


Pp = SFY. (7.14) 


where I is an operator acting on the coordinates and spin of only one (the 
ath) particle, and the summation is over all particles in the system. In the 


t This statement needs elucidation. The spin components Ê, form a contravariant spinor of 
rank one (and in this sense it would be more correct to raise the index: ¥). The components 


Ys form a covariant spinor. Thus Geg is a mixed spinor of rank two, and d,g is a unit spinor of 
this kind. 
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second-quantization formalism, such an operator is written (in the Heisenberg 
representation) as 


Fit) = f PEC, fy r) Bx; (7.15) 


cf. OM, (64.23). Hence it is clear that the mean value of F can be expressed in 
terms of the density matrix as 


(F) = Nf) = f LAP oae(r1, ro)l =r: dx, (7.16) 


where fG is an operator acting on the coordinates r; (we put r, = rı after 
applying this operator but before integrating). 

According to (7.10), the density matrix can be expressed in terms of the 
Green’s function: 


i 


OaplE1, F2) = — N Gap(ti, r1; t1 +0, re). (7.17) 


Here, and everywhere henceforward, writing the argument of the function as 
tı +0 signifies taking the limit as it tends to tı from above. This ensures the 
correct arrangement of the y operators, as in the product (7.13). 

For a microscopically homogeneous system, the density matrix depends only 
on the difference r = rı—r2, and if there is no spin dependence, 0,, = 6,0, 
with 


e(r) = Lat =—0,r); (7.18) 


here G,,(X;, X2) has been replaced by G(X, — X,) = G(X) in accordance with 
(7.12). With rı = re, after taking the trace with respect to the spin variables, 
the operator product in (7.13) becomes ¥+¥,, the operator of the particle 
number density in the system. The mean density of the body is therefore 


N/V = 2Noe(0) = —2iG(t = —0,r = 0), (7.19) 


where ft tends to zero from below. This equation relates the chemical potential 
u at T = 0 (on which G depends as a parameter) to the particle number density 
NIV. 

The Fourier expansion of the function (11, r2) determines the momentum 
distribution of the particles: 


N(p) = N f oi ra) e~ 1-9) P(x, — xa) 
= —i | IGC, nhe-oeP* ax. (7.20) 


t The one-particle density matrix is (see QM,§14) the integral 
oli, ra) = f Pee, a) Pes, gda, 
where Yr, q) is the wave function of the system as a whole, r denoting the position vector of 


one particle and q the set of coordinates of all the other particles, with integration over these. 
The Fourier components of the density matrix are equal to 


f| f Po, pert d3x |? dg, 
and this gives its relation to the particle momentum distribution. 
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This is the number of particles (per unit volume) with a specified value of the 
spin component and with momenta in the range d3p/(2)*. Here we are referring 
to actual particles, not to quasi-particles (which have not yet made an appear- 
ance in the formalism being described). The notation N(p) is used in contrast 
to the quasi-particle distribution function 7(p). 

We shall usually be concerned with the Green’s function in the momentum 
representation, defined as the component of the Fourier expansion of G(t, r) 
with respect to t and r: 


G(t,r) = | G (w, petz- dw d3p/(2n)*, (7.21) 
Glow, p) = fG (t, r) e7@2— dt dx, (7.22) 
The particle momentum distribution is expressed in terms of this function by 
Pe r _,., dw 
N(p) = —i lim f G(w, pje —, (7.23) 
t——0 2x 
which is found by substituting (7.21) in (7.20). It is normalized by 
i dw dp N 
= lot i E 
2i im f G(w, p) e7% nyi y>’ (7.24) 


which is the condition (7.19) in the momentum representation. Thus the distri- 
bution N(p) automatically has the correct normalization 


2 | N) dp/(2n = NIV. 


The limit in which the integrals (7.23) and (7.24) are taken is equivalent to a 
particular contour rule in the plane of the complex variable w. The presence of 
the factor e~’” with t < 0 allows the path of integration (the real axis) to be 
closed by an infinite semicircle in the upper half-plane of w, so that the integral 
is determined by the residues of G(, p) at its poles in that half-plane. 


§8. Determination of the energy spectrum from the Green’s function 


For a microscopically homogeneous system, it is easy to determine the time 
and coordinate dependence of the matrix elements of the Heisenberg y operator 
with respect to stationary states having definite values of the energy and mo- 
mentum. 

The time dependence is given by the usual exponential factor: 


(n| Palt, r) |m) = e'nn | $ur) | MY, (8.1) 


but, since the Heisenberg y operator is defined by means of the Hamiltonian A’, 


we have 


= En—- m— uU(Nn— Nm). 
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According to the general properties of y operators, Ù decreases (and Wt 
increases) by one the number of particles in the system. Hence N, = N,,—1 
in the matrix element (8.1), so that 


Onm = n(N) ~E,,(N+ 1) a H» (8.2) 


where the arguments are the numbers of particles in the corresponding states. 

To determine the dependence on the coordinates, we note that, since the 
system is homogeneous, a displacement relative to the system through an 
arbitrary distance r cannot alter the matrix elements of its y operators. This does 
not mean, however, that the matrix elements are independent of the coordi- 
nates. The reason is that the difference between y,,,(r) and the value »,,,(0) 
at some specified point r = 0 is due to two causes: the displacement through r 
relative to the system itself, and the movement of the point of observation to a 
different position, which also changes the phases of the wave functions. In 
order to exclude this latter change, we shift the system through —r, i.e. apply 
to its wave functions the parallel-translation operator 


T(—r) = e-r, 
where P is the operator of the total momentum of the system; see QM, (15.13). 
These operations return the point of observation to its original position, but 


it remains shifted by r relative to the system. The invariance of the matrix 
elements under this transformation is expressed by 


(n| @x(0){ my = lnl et 2G, (r) eB! m). (8.3) 
If the system has definite momenta P, and P,, in the states n and m, then 


(1! PO) | m) = elkamt (n! a(r)! M), 
whence 
(n i Y(t, r)! my = Ontan | (0)! m), 8.4) 
(n| WF(t, r)! m) = (m' L(t, r)'n)*, G 
where Kpn = P,—P.,- 

Using these formulae, we can deduce an important expansion of the Green’s 
function in momentum space, which clarifies its physical significance. 

Because of the “discontinuous” definition of the function G(t, r), in calculat- 
ing G(w, p) we must separate the integral over t in (7.22) into two, from — œ 
to 0 and from 0 to oo. In the second (i.e. when t = tı— t > 0), we expand the 
definition (7.10) by the matrix multiplication rule and find 


G(t,0) = 3 iGa = — Fi Y (0| P(X)! m) (m| PECX) | 0), 


with summation over all quantum states of the system. Substituting (8.4) and 
noting that Po = 0 in the ground state, we have 


G(r) = =H F [COP O)|m)|? Comte, (8.5) 
where Wom = Eo(N)—E,,(N+1)+u. 


§ 8 Energy Spectrum from the Green’s Function 35 


o 


The spatial integration in (7.22), with G(t, r) from (8.5), gives the delta func- 
tion 6(p—P,,,) in each term of the sum. In the integration over t (> 0), to ensure 
convergence, we must add to w an infinitesimal positive imaginary part, i.e. 
replace w by w+i0.' Then 


1 i o(p—P,,,) 
i(@t—p-r) 3 rs 3 2 —_——__, 
{| G(t, r) e d*x dt 5 (27) py |<O|%.(0)|>| SESTRE, 
Ô 


The integral over t from — ~ to 0 is calculated similarly. For t < 0 we have 
instead of (8.5) 


G(t, r) = 4i F. | (m | Pa(0)! 0) |? el@mot—Pm-t), (8.6) 


where wpo = E,,(N—1)—Eo(N)+ u. Now, calculating the integral from — <o 
to 0 and adding it to the other, we obtain 


AmO(p—Pm) 


1 
o i, nh! A 7 
Go, p) = 5 2x} > | @+tEAN)—E,(N+1)+10 * 
@+ wt EAN—1)—Eo(N)—10 (° 
with the notation 
Am = |(0| P.(0)! m), Bm = |(m | Pe(0) | 0) |?. (8.8) 
This is the required expansion.* 
We shall use the notation 
eP = En(N+1)—E(N), P = Eo(N)—Em(N—1) (8.9) 


for the excitation energies given by the differences between the excited level of 
the system with a particular number of particles and the ground state of the 
system with one particle more or fewer. The superscripts (+) and (—) indicate 


the inequalities 
Gap, <u. (8.10) 


For, since Eo(N+1)—Eo(N) ~ OFo/ON = u, the chemical potential at T = 0, 
we can write, for instance, 


et) = Em(N-+1)~Eo(N+ 1)+Eo(N+ 1)—Eo(N) 
~ [Em(N+1)—Eo(N+ 1)]+ u. 


t This procedure is analogous to the method of calculating Green’s functions in quantum 
electrodynamics (cf. ROT, §76). 

+ The corresponding expansion in quantum field theory is the Kallén—Lehmann expansion 
(cf. ROT, §§101 and 108). 
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The difference in the square brackets (where both energies relate to systems 
with the same number of particles) is positive by the definition of the ground 
state; hence {H > u. The significance of the definition (8.9) will be discussed 
again below. 

The displacement of the poles of the terms in the sum (as functions of w), 
expressed by the terms +0 in their denominators, is equivalent to the presence 
of delta-function imaginary parts according to! 


1 nee 
ano pa F ind(x). (8.11) 
Applying this to (8.7), we find as the real part of the Green’s function 


re G(w, p) = 42°)" P 


m 


eiea eT] , (8.12) 


OF EGP) ot us 


and its imaginary part (since each difference e{+)— u > 0, and each difference 
ef — u < 0) 


— 424 Y AmO(P—Pm) (o+u—eP) for œ > 0, 


im G(o, p) = 4ni 5 BmO(p+Pm) (o+yu—eS) for wo <0. GT) 
Hence we always have 
sgn im G(o, p) = —sgn w. (8.14) 


We may also notice the asymptotic behaviour of the function G(w, p) as 
w — co, From (8.7), 


Ans 
G(a, p) = a S [Amb(P— Pm) + Bm + Pm). 


The coefficient of 1/w is easily seen to be the Fourier component with respect 
to rı—Tr> of 


+ (L(t, rı) W(t, rə) + Plt, ro) Pt, ri)} = é(r, —T9), 


i.e. unity. Thus 
G(w,p) + 1/w as |w] +o. (8.15) 


t See QM (43.10). The symbol P denotes that in the integration of expressions of the form 
I (x)/(x+ i0) the integral is to be taken as a principal value: 


ao 





x10 


— OO 


I(x) dx =P Í LO) ax inf). 


The second term comes from the passage round the pole x = — 70 or x = i0 along a semicircle 
above or below it respectively. 
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The chief property of the Green’s function in the momentum representation 
is that its poles can only be at the points w = ¢,,— H, where ¢,, are the discrete 
excitation energies of the system, defined as shown above. Each of these energies 
corresponds to a definite value of the momentum P, of the system, as is evident 
from the presence of a corresponding delta function in each pole term of the 
Green’s function. 

We are interested here, however, in the Green’s function of a macroscopic 
body. This means that we are considering the limit in which the volume V and. 
the number of particles N tend to infinity (for a fixed finite value of the ratio 
N/V). In this limit, the separations between the levels in the system tend to 
zero, the poles of the function G(w, p) merge, and we can say only that this 
function has an imaginary part for values of w+ u in the continuous range of 
possible values of the excitation energy of the system. Excitations in which the 
whole momentum p of the macroscopic system can be ascribed to one quasi- 
particle with a definite dispersion relation e(p) (in the ground state of the 
system, p = 0) form an exception; such values correspond to isolated poles 
of the Green’s function. 

If the momentum p is made up of the momenta of more than one quasi- 
particle, the energy of the system is not uniquely determined by the value of p: 
a given momentum of the system can be composed in various ways of quasi- 
particle momenta, with the total energy of the quasi-particles covering a con- 
tinuous range of values; the pole is removed by integration over all such states. 

Thus the quasi-particle dispersion relation is determined by the equation 


G-1(e— u, p) = 0 (8.16) 


(V. L. Bonch-Bruevich 1955). 

It should be emphasized that the definition of the excitation energy as in (8.9) 
in fact corresponds to the definition of the quasi-particle energy in the Landau 
theory: the difference e‘t) is the change in the energy of the system when one 
particle is added to it, and if the whole of this change is ascribed to one quasi- 
particle we have e defined in accordance with (1.3). Similarly, — eS is the change 
in energy when one particle is removed, and so íp is the energy of the quasi- 
particle removed. It is therefore natural that e < u, since in the Landau 
theory a quasi-particle can be removed only from within the Fermi sphere.* 

Since all the excited states that occur in the expansion (8.7) are obtained 
from the ground state by adding or removing one particle with spin 4, it is 
clear that, for a system of fermions, the poles of the Green’s function determine 
only the spectrum of Fermi-type elementary excitations. It will be shown in §18 
how the Bose branch is determined. 


t It should be noted that the excited level En of the system appears with the negative sign 
in the definition of the quasi-particle energy e‘—), This is the reason why the momentum of 
these quasi-particles p = — Pm, as is seen from the delta function 6(p+ P,,) in the corresponding. 
terms of the expansion (8.7). 
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The description of the spectrum of a macroscopic system by means of the 
concept of quasi-particles with a definite dependence of e on p is an approxi- 
mate one, whose accuracy diminishes with increasing 'e— ui. The departure 
from the picture of independent quasi-particles is shown by the shift of the 
Green’s function pole into the complex domain, e(p) becoming complex. 
According to the general principles of quantum mechanics (see QM, §134), 
complex energy levels signify a finite lifetime z of the excited state of the sys- 
tem: t ~ 1/|im e'. The quantity im e itself represents the degree of “broaden- 
ing” of the quasi-particle energy values (the level width). Of course, this treat- 
ment is meaningful only if the imaginary part is sufficiently small, 'im €| << 
<«|e— ul. As explained in §1, this condition is in fact satisfied for weakly 
excited states of the system, since 'im e| ~ }/t oc (p—prf)®, whereas re(e— u) oc 
oc | p—Prl- 

The required sign of im e is ensured by the fixed sign of the imaginary part 
of the Green’s function: near its pole, this function has the form 


G(o, p) ~ Z/[o+ u—e(p)], (8.17) 


and the constant Z > 0, as follows from the fact that the coefficients A,, and 
B,, in the expansion (8.7) are positive; Z is often called the renormalization 
constant (by analogy with quantum electrodynamics). The imaginary part 
of the Green’s function is 


imG ~ Zim e/lø+u— e". 


Since this expression relates to values of w ~ e—y, we find, on comparing its 
sign with the rule (8.14), that 


ime<0O when ree = (8.18) 


ime >0 when ree< u, 


as it should be: this sign of im corresponds in both cases (e¢*) and ef in (8.9)) 
to the correct negative imaginary increment to the energy E,, of the excited 
State. 

The analytical properties of the Green’s function will be further discussed 
in §36, where this question will be considered for the general case of arbitrary 
temperatures. 


§ 9. Green’s function of an ideal Fermi gas 


To illustrate the general relations given in §8, let us calculate the Green’s 
function of an ideal gas. 
The Schrödinger y operators can always be written as an expansion 


P(r) T 5 ÂpoYpalt , o) (9.1) 


$9 Green’s Function of an Ideal Fermi Gas 39 


in terms of a complete set of functions y,,, the spinor wave functions of a free 
particle with momentum p (and spin projection o), i.e. in plane waves 


o 
oa ES ele (9.2) 
where u, is the spinor amplitude normalized by the condition u,u* = 1; this 
choice of the functions ,, has no connection with the actual interaction of the 
particles in the system. 

For a system of non-interacting particles, the Heisenberg y operator can 
also be written in an explicit form. In this case, the change from the Schrödinger 
to the Heisenberg representation consists in placing in each term of the sum in 
(9.1) the corresponding time factor: 

2 

P(t, r) = Xen, 0)exp [-; (ora z n) r| (9.3) 
This is easily seen if we note that the matrix elements of the Heisenberg operator 
for every transition i + f must contain factors exp [—i(E;—E;)t], where E; 
and E; are the energies of the initial and final states (in this case, eigenvalues 
of the Hamiltonian A’ = A—uN). For a transition with decrease in the number 
of particles in the state p, « by one, the difference E; —E; = p?/2m— u, so that 
the condition stated is satisfied. 

However, instead of directly calculating the Green’s function by means of 
(9.3) from the definition (7.10), it is more convenient to begin by converting 
this definition into an equivalent differential equation. To do so, we differentiate 
G,(X1—X2) with respect to tı. It is necessary to take account of the disconti- 
nuity of this function at t; = ta: according to the definition (7.10), the amount 
of the discontinuity is 


[Gep] = [Gap] =t +0— [Gap] =r —0 
= —i(P (ty, r1) Pi (t1, r) + EF (t, ra) Balti rD) 


or, from (7.3),' 
[Gap] = i6.g0(r1—F2). (9.4) 


The presence of the discontinuity gives rise to a term [G,,]5(t1— t2) on differ- 
entiation. Hence 


ice Gap = —i / eL PE Y- iôapò(f1—r2) (tı — ta). (9.5) 


Ory X 
For a system of free particles, the Heisenberg y operator satisfies the equation 
OL, 1 
1 Bp = gm OE Pa; 


t It must be emphasized that the magnitude of the discontinuity does not depend on the 
interaction of the particles. 
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cf. (7.8). Substituting this derivative in (9.5) and again using the definition 
(7.10), we get as the equation for the Green’s function 


(i a ES = af n) G(r, r) = (A) d(r), (9.6) 


where we have put G P = 6,,G; the superscript (0) to G indicates that there 
is no interaction between the particles. 
This equation has the Fourier transform 


(0-74 n) Go, p) = 1. 

2m f 

In determining the Green’s function from this, we must add to w an infini- 

tesimal imaginary part in such a way that the imaginary part of G has the cor- 
rect sign in accordance with (8.14): . 

2 -1 

Go, p) = |o-Fat +10. sgn J ; (9.7) 

The pole of this expression is at w+ u = e(p) = p?/2m, in accordance with 

the fact that in an ideal gas the quasi-particles are the same as the actual 

particles. The chemical potential of an ideal Fermi gas is u = p}/2m. For 

weakly excited states, p is close to pp, so that we can put p?/2m ~ u+ vp(P— Pr), 

where vp = p/m, and write the Green’s function for such states in the form 


Gw, p) = [w— ve(p—pr)+i0. sgn wt. (9.8) 


In all integrations involving the function G™, the presence of the infinitesimal 
imaginary part in its denominator is important only near the pole, when 
w ~ Up(p—p,). In this sense, sgn w in (9.7) may be replaced by sgn (p— pp), 
and G® written as 


Gw, p) = [w?— p?/2m + u + i0. sgn (p—pr)]-}. (9.9) 


This change is important in that G in the form (9.9) is a single function of the 
complex variable w, analytic throughout the plane, and the methods of the 
theory of analytic functions can be used to calculate the integrals. 

For instance, to calculate the integral (7.23) (the particle momentum distri- 
bution) for a non-zero negative t, we close the contour of integration (the real 
@-axis) by an infinite semicircle in the upper half-plane (and can then put 

= 0). The integral 


NO =- | s=ppmtasios GP 


is now determined by the residue of the integrand at the pole in the upper half- 
plane. When p > pp there is no such pole, and N(p) = 0. If p < pp, however, 
we find N(p) = 1, as it should be for the ground state of an ideal Fermi gas. 
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§ 10. Particle momentum distribution in a Fermi liquid 


The Green’s function of a Fermi liquid cannot, of course, be calculated in a 
general form as was done for a Fermi gas. But the statement that a Fermi 
liquid has a spectrum of the type described in §1 implies that its Green’s func- 
tion has a pole at 


w= e(p)— u ~ Uf(p—pr), UF = pr/m*. (10.1) 
It can therefore be written as 
Z 
G(o, p) = 


DAP pp Tena p), (10.2) 


where g(w, p) is a function finite at the point (10.1). As already noted in con- 
nection with (8.17), the coefficient Z (the residue of G at the pole) is positive. 

An interesting conclusion can be drawn from (10.2) about the nature of the 
particle (not quasi-particle) momentum distribution in the liquid. We calculate 
the difference between the values of the distribution function N(p) (which in 
practice depends only on the magnitude p) on the two sides of the surface of the 
Fermi sphere, i.e. the limit of the difference N(pp—q)—N(pr+q) as q — +0. 

The distribution N(p) is expressed in terms of the Green’s function by the 
integral (7.23). Since g(a, p) is finite, it is evident that the difference between 
the integrals of g tends to zero with q. It is therefore sufficient to consider the 
difference between the integrals of the pole terms in (10.2). Since, in this integ- 
ration, the term i0 in the denominator is important only near the pole, we can, 
as already mentioned in §9, replace sgn w by sgn( p—p,). Then 


; J Z Z dw 
N(pr—4)—-N(pr+9) =—i f e E esc FD 
since this integral of the difference converges, the factor e~'” in it, with t = — 0; 


may be omitted. Now, closing the contour of integration by an infinite semicircle 
in either half-plane, we find that the whole integral is equal to Z, and inde- 
pendent of g. Thus 


N(pr—9)—N(pr+9) = Z (10.3) 


(A. B. Migdal 1957). 
It has been mentioned above that Z > 0. Since N(p) = 1, it follows from 
(10.3) that 
0<Z<!l; (10.4) 


the value Z = 1 is reached only in the limit of an ideal gas. 
The particle momentum distribution in a Fermi liquid at T = 0 therefore 
has, as in a gas, a discontinuity on the surface of the Fermi sphere, decreasing 
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towards the outside. Unlike the gas case, however, the magnitude of the dis- 
continuity is less than unity, and N(p) remains non-zero for p > pr, aS shown 
in Fig. 1 by the continuous curve; the broken curve corresponds to a gas. 
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§ 11. Calculation of thermodynamic quantities from the Green’s function 


A knowledge of the Green’s function of a system is sufficient to describe its 
thermodynamic properties. When T = 0, these properties are expressed by the 
dependence of the energy of the system (which is the ground-state energy Eo) 
on the density N/V. 

When the quasi-particle dispersion relation e(p) has been determined (by 
solving equation (8.16)), this dependence can be found by using the fact that 


e(pr) = u. (11.1) 
Since the dependence of pp on N/V is known, from (1.1): 
pr = (37°)8(N/V)48, (11.2) 


equation (11.1) determines the function u(N/V) (though in an implicit form, 
since the dispersion relation (p) in general contains u as a parameter). At 
T = 0 (and therefore S = 0), the chemical potential u = (0Eo/0N)p; integra- 
tion of this gives the required energy 


Eo = { u(NIV)AN; (11.3) 
0 


when N = 0, Eo = 0, of course. 

Another way of describing the thermodynamic properties at T= 0 is to cal- 
culate the thermodynamic potential 2. According to the general definition 
(see Part 1, §24), this potential Q = E—-TS— uN = —PV, and its differential 
dQ = —SdT—Ndu; when T = 0, S = 0 also and these expressions reduce to 


Q2 = E—uN, (11.4) 
dQ = —Ndu. (11.5) 


The significance of the potential Q is that it describes the properties of the sys- 
tem at constant V. 
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The simplest method of expressing Q in terms of the Green’s function is to 
use the relation (7.24) between N/V and G. Substituting N from (7.24) in (11.5) 
and integrating with respect to u (with V constant), we obtain 


u 
, , -ior PP FO 
Q(u) = 2iV f du. lim, { Oe p)e~ ia One’ (11.6) 


0 





since again 2 = 0 when u = 0. 


§ 12. ¥ operators in the interaction representation 


The Green’s function for a system of interacting particles cannot, of course, 
be calculated in a general form. There is, however, a mathematical technique 
(similar to the diagram technique in quantum field theory) whereby it can be 
calculated as a power series in the particle interaction energy, each term being 
expressed by means of the Green’s functions of a system of free particles and 
the interaction operator. 

We shall use, as well as the Heisenberg representation, a representation of 
operators in which their time dependence is given not by the actual Hamilto- 
nian of the system 


H' = HOP = AO —uN+V 


(where V is the interaction operator) but by the free-particle Hamiltonian 
(0): 
W(t, r) = exp iA’) P(r) exp (iR VÀ. (12.1) 


The operators and wave functions in this interaction representation will be 
distinguished by the suffix 0. By expressing the Green’s function in terms of the 
operators Wo (instead of the Heisenberg operators W) we take the first step 
towards the objective of expressing G in terms of G% and V. 

In this section, ® or @ will denote wave functions in “occupation number 
space” (in contrast to the coordinate wave functions ¥ or y); these functions 
are acted on by second-quantized operators. Let ọ be such a function in the 
Schrédinger representation; its time dependence is given by the wave equation 


i0¢/Ot = (A'O Pje. (12.2) 


In the Heisenberg representation, where the whole of the time dependence is 
transferred to the operators, the wave function ® of the system is a constant, 
independent of time. In the interaction representation, however, the wave 
function ®o is time-dependent, but only because of the interaction of the par- 
ticles in the system, and is given by 


10Do(1)/Ot = Vo(t) Bolt), (12.3) 
where 


Vo(t) = exp (R'A Y exp (—i’2) (12.4) 
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is the interaction operator in that representation; for operators having the 
form (7.6), (7.7), the change to the new representation is obtained by simply 
substituting Yo for Ý. Equation (12.3)is easily derived, since the transformation 
of operators by (12.1) corresponds to transformation of the wave functions 


according to 
Do = exp (if’ dd; (12.5) 


see OM, §12. Differentiating this and using (12.2), we get (12.3).t 
From (12.3), the values of Do(t) at two successive instants are related by 


Do(t+ ôt) = [1—i6t.V (HPH 
= exp {—i6t.Vo(t)} Do(t). 


Accordingly, the value of Øo at any instant ¢ can be expressed in terms of its 
value at some initial instant to (< t) by 


D(A = S(t, to) Bolto), (12.6) 
where 
t 
S(t, to) = T] exp {—iôt.Po(ti)}; (12.7) 
ti=to 
the factors in this product are clearly arranged from right to left in order of 
increasing time t;; it is understood that we take the limit of the product over 
all the infinitesimal intervals ĝt between to and t. If Vo(t) were an ordinary 
function, this limit would reduce simply to 


exp f- f Vo(t) ar) ; 


but this result depends on the commutativity of the factors pertaining to 
different instants, which is assumed in changing from the product in (12.7) 
to the summation in the exponent. For the operator V(t) there is no such 
commutativity, and the reduction to an ordinary integral is not possible. In- 
stead, we can write (12.7) in the symbolic form 


S(t, t) = T apf- f Vo(t) ar) ; (12.8) 


where T denotes the chronological ordering of the factors in the same sequence 
as in (12.7), i. e. with the time increasing from right to left. 
The operator $ is unitary ($-t = $+), and has the obvious properties 


S(ta, te) §(to, tr) = S(ts, t1), 12.9 
S-to, t1) S-U ts, t2) = S~ (ts, t1). j i 


+ Equation (12.3) is the same as ROT (73.5), and the following method of solution repeats 
that given in ROT §73. 
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To simplify the subsequent analysis, we make the formal hypothesis (which 
does not affect the final results) that the interaction Vo(t) is adiabatically 
“switched on” between t = — æ and a finite time, and adiabatically “switched 
off” at t = + œ. Then, as t + — œ, before the interaction begins, the wave 
function ®o(t) coincides with the Heisenberg function ®. Putting t = — œ 
in (12.6), we get 

D(H) = S(t, — 0)®. (12.10) 


Having thus established the relation between the wave functions in the two 
representations, we also have the transformation rule for operators, including 
w operators: 


Y= S-t, — œ) Post, — œ). (12.11) 


Since § is unitary, the operators Ý+ are transformed in the same way. 
Let us now express the Green’s function in terms of y operators in the in- 
teraction representation.’ Let t; > te; then 
Gup(X1, Xo) = — iE) PF (t2)) 
= —KS§-(t,, — œ) Polti) S(t, — ©) x 
x S-ta, — co) Wes(t2) S(te, — <2). 
According to (12.9), 
Sti, — ©) Ste, — ©) = S(t, t2) §(t2, — co) S“(tg, — 00) 
= S(t, to), 
S(t), — œ) = $-Xt, — ©) S-A, t1) Sœ, ty) 
= S-o, — œ) S(c0, tı). 


Substitution in the preceding expression gives 
GuaX, Xo) ear i $-3(co s oo) S( < 9 ty) Polti) S(t, te) P hlt) S(te, =~ oS )). 


Taking the operators § as the products (12.7) we see that all factors from the 
second onwards in the averaged expression are in chronological order from 


right to left, £ = — œ to t = œ. We can therefore write 
Gal Xi, X2) = —i($-1 T Polt) Pilta) ST), (12.12) 
with 
S = (œ, — œ) = Tepl- f Pa) arl. (12.13) 


The calculation with t; < tz differs from the above only in the notation, 
and the final result (12.12), (12.13) is valid for any tı and tz. 

The transformation made does not depend on the state of the system with 
respect to which the averaging is done. However, if the averaging is with 


t This derivation repeats the one given in ROT, §100. 
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respect to the ground state (as in (12.12)), the transformation can be carried 
further. To do so, we note that the adiabatic switching on or off of the inter- 
action, like any adiabatic perturbation, cannot cause a transition with change 
of energy of the quantum system (see QM, §41). Hence a system in a non- 
degenerate state (such as the ground state) will remain in that state. That is, 
the effect of the operator $ on the wave function ® = ®o(— œ) must reduce 
to multiplication by a phase factor (which does not affect the state), the mean 
value of § in the ground state: SØ = ($\®. Similarly, B"S-! = (§)- 10", 
Thus we have finally the following formula for the Green’s function in terms 
of operators in the interaction representation :* 


iGap( Xa, Xa) = tla PiX) SI). (12.14) 


According to the meaning of this representation, the averaging in (12.14)is with 
respect to the ground state of a system of free particles: the properties of the 
operators Ware the same as those of the Heisenberg operators ¥ in the absence 
of interactions, and the Heisenberg wave function ® is independent of time, 
so that it is the same as its value at t = — oo, when there is no interaction. 
Hence, in particular, 


(TPo.(X1) PHX) = iGQ(X1, Xe) (12.15) 


is the Green’s function of a system of non-interacting particles. 


§ 13. The diagram technique for Fermi systems 


The significance of symbolic expressions such as (12.14) is that they make it 
easily possible to write down the successive terms in expansions in powers of 
V. For example, 


(T Pol X) Pal) $) = 





n! 


n=0 


S E [dinos È UKT DOPE Palt -Path 13.1 


— 00 —co 


and the expression for ($) differs from the above only in that the factors Dont ba 
do not appear in the T product. As already mentioned, the operator Vo(t) in 
the interaction representation is found from (7.7) by replacing all the ¥ by o. 
The calculation of successive terms in the expansion (13.1) thus reduces to the 
averaging, with respect to the ground state, of the T product of various num- 
bers of y operators of free particles. 


t The notation in (12.14) is in a certain way conventional: although it contains the symbol 
T twice (once explicitly and once in the definition of S), all factors in the product must really 
be arranged in a single chronological sequence. 
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These calculations are made largely automatic by the rules of the diagram 
technique, which, however, essentially depend on the nature of the physical 
system considered. The technique described in this section relates to non- 
superfluid Fermi systems, the particles being assumed to have a spin-independ- 
ent pair interaction. The corresponding interaction operator is 


Volt) = 4 f P(t r1) PGC, r2) U(r1—re) Posl, r2) Polt, r1) dxi dxo, (13.2) 


where U(rı—r2) is the interaction energy of two particles; the superscripts 
(2) to V and U are omitted. 

The mean value of products of y operators is calculated by Wick’s theo- 
rem:* the average of the product of any (even) number of operators ¥ and ¥* 
is equal to the sum of products of all possible means (contractions) of pairs 
of these operators. In each pair, the operators are in the same order as the 
original product. The sign of each term in the sum is given by the factor 
(—1)’, where P is the number of interchanges of operators needed to bring all 
the averaged operators together. 

Only those contractions are non-zero which contain one operator ¥ and one 
+ :in the diagonal matrix element, all particles annihilated by the operator ¥ 
must be created again by ¥*. It is therefore clear that the mean value of the 
product of several y operators can be non-zero only if it contains the same num- 
ber of operators ¥ and ¥t. 

When applied to the average of the T product, Wick’s theorem enables it 
to be expressed in terms of the means of paired T products, i.e., according to 
(12.15), in terms of the Green’s functions of free particles. We shall do this for 
the first-order correction to the Green’s function of a system of interacting 
particles. 

First of all, let us note that, in expanding the expression in the numerator 
of (12.14) by Wick’s theorem, we get, in particular, terms of the form 


(Toa X1) lX) (S) = iGQ(X1, X2) (S), (13.3) 


in which the pair of y operators that are “outside” $ are contracted; the ex- 
pression for (S$) contains, in each term of its expansion, only contractions of 
“inside” operators. The factor ($) cancels entirely with the denominator in 
(12.14), and so all these terms give just the “unperturbed” Green’s function 
iGQ. 

- Retaining the first two terms of the expansion in (13.1), substituting (13.2) 
and renaming the variables, we find 

iGup(X1, X2) = iGY+iGY, 
where 


IG = — 4 iT Pol X1) Phl X) X 
x f dt f dxs Px PEt, r3) Y(t, ra) U(r —r4) Polt, ra) Y(t, I's)). 


t The proof is given at the end of the section so as not to impede the discussion here. 
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For greater compactness in the formulae, we use the notation 


U(X1 — Xe) = U(rı—r2) (tı — t2). (13.4) 
Then’ 
IGP = — hi | TR PYG PY) Uu Xs Xs, 


where d!X = dt dx. 
In order to average by Wick’s theorem, we write out the operators separately 
and show all the relevant contractions: 


+ Us + + US + ok 
, eee | Ta: 
+ PPPS EV Pat DD, 


The terms containing contractions Pw have been omitted in accordance 
with the previous discussion. The operators contracted in pairs (joined by the 
loops) are to be interchanged so as to be adjacent. For instance, the first term 
written above denotes the product 


(THES) (THEY) TP Pa), 


and the last one 1s 


— (TH Py) (TEE) Pes). 


The contractions of products of y operators with different arguments are 
replaced according to 


P+ = (THF) = iG, VEY, = —iG%,, etc. 


Those of y operators with the same argument represent the spatial number 
density of particles in an ideal gas (denoted by n‘), regarded as a function of 
the chemical potential :* 


CEP) = n(n) = (2my)3!?/3n2. (13.5) 
Thus we have 
iGD = 4 f dX, BX Uy l—-GOGLGY—GLGLGY 
+ in GOGO + in OGGY]. 


t Here and below, to simplify some particularly cumbersome expressions, we omit the 


suffix in P and denote by numerals 1, 2,... the set of values of the argument X and the spin 
index: 


IH 


È, = PX), P, = P,(X,), 
Giz = Gap(Xis X2), Uy, = U(X- X2), «.- 
+ Such contractions always arise from y operators that appear in the same interaction oper- 
ator Ẹ. Hence Y+ in such terms is always to the left of ý. 
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These four terms are equal in pairs, differing only in the naming of the variables 
of integration Y; and X,. Thus the factor } disappears and the first-order 
correction to the Green’s functions has two fern: 


GY = | UsglinGQGY —GOGNGY] aX, dtX,. (13.6) 


The structure of these terms is conveniently represented graphically by 
means of the Feynman diagrams 


0 25x (13.7) 
{ 4 2 I 3 4 2 


Here the continuous line 4 — 2 denotes the contraction Y, W3 (i.e the function 
iG); the numerals refer to the variables X, and X2 on which the contracted 
operators depend, and the direction of the arrow corresponds to the direction 


from Ë+ to ¥ in the contraction. The contraction Ý+ ¥ of two operators depend- 
ing on the same variables (i.e. the density n) is represented by a loop — a 
closed continuous line. The broken line 3---4 denotes the factor U;,. Integra- 
tion is implied over all variables shown at interior points in the diagram 
(points of intersection of lines). The variables (X; and X2) shown at the exter- 
nal lines of the diagram remain free. 

The first-order terms arising from (13.3) would have diagrams in two sepa- 
rate parts: a straight segment (iG! I, and a diagram with closed loops of con- 
tinuous lines, e.g. 





With an understanding of the method of operator contraction and the structure 
of the corresponding diagrams, we can see the origin of the general rule accord- 
ing to which, in all orders of perturbation theory, the role of the factor ($)-? 
in (12.14) is to take into account only “connected” diagrams with two external 
lines, which contain no detached loops without external lines that are uncon- 
nected to the rest of the diagram by either continuous or broken lines. Cf. 
RQT, §100, for a similar situation in quantum electrodynamics. 

The cancelling of the factor 4 in (13.6) is an instance of a general rule: it is 
not necessary to include (in the mth-order terms) the factor 1/n! from the ex- 
pansion (13.1) or the factor 2~” from the coefficients } in (13.2). The reason is 
that diagrams of order n contain n broken lines i---k. The factor 1/n! cancels 
from the combination of terms differing by interchanges of pairs of numbers 
i, k among all n broken lines. The factor 27” cancels from the interchanges of 
the numbers i, k between the ends of each line. 

The rules of the diagram technique will be finally formulated for the calcu- 
lation of the Green’s function not in the coordinate representation but directly 
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in the momentum representation, which is the most important in physical 
applications. 

The change to the momentum representation is made by means of the Fourier 
expansion (7.21), (7.22), which we write in the four-dimensional formt 


G(X) = | G(P) e™iPX d*P/(2n)4, 


13. 
G(P) = f G(X)eP* atx, i 


where the “4-momentum” P = (a, p), and PX = wt—p.r. We can Similarly 
expand the interaction potential: 


U(X) = 5(t) U(r) = | U(Q)e“2* dOr), (13.9) 


where O = (qo, q); U(Q) is the same as the three-dimensional expansion com- 
ponent, 

U(Q) = U(q) = | Ue ax. (13.10) 
Since U(r) is even, it is clear that U(—q) = U(q). 

Let us make this expansion for the first-order correction Gf? = GO(X1— 
— X2). To do so, we multiply equation (13.6) by exp [iP(X1— X2)] and integrate 
over AX, = Xo). 

In the first term we write 

eP XiX) == el P(X1~Xa) eiP(X3—X:) 


and, changing the variables of integration, obtain 
in f GO(X1— Xs) eP- d4(X,— X3) X 
X GY (X3 — Xo) eP- d(X3— X2) | U(Xs— Xi) d(Xs—X,). 
The first two integrals give GO(P) GQP), and the third is U(0) = l] U(r) dx, 
the value of U(q) for q = 0. 
Similarly, in the second term we write 


ef P(X — Xe) = elP(X1—Xa) ei P(Xa—Xa) ei P(X —X2) 


and, after changing to'integration with respect to X1—X3, X3s—X,, X,—X2, 
obtain 
— GOP) | CRX) U(X) PX dtX.GY(P). 


The remaining integral is expressed in terms of the Fourier components of 
G") and U by means of the formula for the Fourier components of the product 
of two functions’ 


JASOA) PX HX = | f(P1) (P-P) dP: 2r}. (13.11) 


t Though using for convenience a four-dimensional terminology in the discussion and 
notation, we must reiterate that it is entirely unconnected with relativistic invariance. 

t To prove this formula, we must substitute on the left the functions f(X) and g(X) as 
Fourier expansions: 


J F(X) eX eE dX = S AP) g(P2) APP? d4X dP d4P2|(2n)*. 
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Thus the first-order correction to the Green’s functions in the momentum 
representation is finally 


iGY(P) = in® U(0) GQP) GQ(P)— 
— Í GXP) GQ(P1) GRP) Up- pi) dP:/(27). (13.12) 


Each of the two terms in (13.12) corresponds to a particular Feynman dia- 
gram, and this equation may be written 


° 


iG (Pj= | + pai (13.13) 
P ala P P aer 


The points of intersection of lines are called vertices of the diagram. Each 
diagram has 2n vertices, where n is the order of the perturbation theory. At 
each vertex, two continuous lines and one broken line meet. To each con- 
tinuous line is attached its “4-momentum” P in the direction shown by the 
arrow (and the direction of the arrows is unchanged along each continuous 
sequence of such lines). To each broken line is attached a 4-momentum Q; for 
these lines, conventionally any direction of the arrow may be chosen.t The “con- 
servation of 4-momentum” holds at the vertices of the diagram: the sum of the 
4-momentum for the ingoing lines is equal to that for the outgoing lines at each 
vertex. Each vertex also has a particular spin index «. Each diagram has two 
external lines (one ingoing and one outgoing), whose 4-momentum is the argu- 
ment of the required Green’s function iG,,(P); these two lines also have the 
spin indices « and £ of that function. The remaining lines in the diagram are 
called internal lines. 

The analytical form of the terms corresponding to each diagram is deduced 
from the following rules: 

1. Each continuous line between vertices « and £ is associated with the factor 
iGQ(P), and each broken line with the factor —iU(Q). A closed loop with one 
vertex is associated with the factor n(n). 





The integration over d*X is effected by the formula 
efPX dX = (2n)*64(P), 


where the “four-dimensional” delta function 6“ is defined as the product of delta functions 
of the components of the “4-vector” P. The resulting factor 6 (P—P,—P,) is removed by 
integration over d‘P., and we have the right-hand side of (13.11). 

t The “time” components of the 4-vectors Q = (go, q) are in general non-zero, but the 
function U(Q) is independent of qa by the definition (13.10). The arbitrariness of the direction 
of the broken line arises because the function U(—Q) = U(Q) is even. 
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2. At each vertex, the conservation of 4-momentum holds. There is integra- 
tion over d*P/(27)* for the 4-momentum of internal lines that are left indeter- 
minate. At each vertex there is summation over a pair of dummy spin indices, 
one from each of the adjacent G® factors. 

3. The common factor of the diagram in iG,, is (— 1)", where L is the number 
of closed loops of continuous lines with more than one vertex in the diagram. 

This last rule arises as follows. A closed loop with k (> 1) vertices comes 
from the contraction of y operators in the form 


DEPED... WiP 





Here, the contractions equal iG{), ..., iG; ,, and finally —iG{?. For loops 
with one vertex, the correct sign is already obtained by the presence of n 


from rule 1. 
As an example, here is a set of diagrams giving the second-order correction 


to the Green’s functions: 


a ie 
pee ep PEET AR lead So, ee 
(a mee (b) 
= } ; 
S S «4 — J 
(c) ‘Sg (d) O SO 


(e) 
() (13.14) 


i | on -~ 


(f) (g) (h) 
l -~ -~ l 


Lastly, let us return to Wick’s theorem and prove it in the “macroscopic 
limit” (i.e. as V — æ or, equivalently for a given density of the system, as 
N — æ), which is the only important case in statistical applications. 

Let us consider, for example, the averaged product of four y operators, of 
the type 


Phot Gea) = vd Moa Gy, Âp âp EXP (.. .); (13.15) 


1 p? P3 ap, 
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the y operators are in the form (9.3), and the obvious but lengthy exponents 
are omitted. In this sum, the only non-zero terms are those containing equal 
numbers of operators â, and 4+ with the same values of the momenta. They 
include terms in which the momenta are equal in pairs, e.g. pı = p4 and pe = ps. 
These correspond to the paired contraction 


Poy PoP eve 


and are expressed by a sum of the form 


Ta E OAh) Oyen...) 


In the limit V — œ, the summation over p; and pz is replaced by integration 
over V%d8p,d8po/(2x)*; the volume V cancels, and the expression remains 
finite. In the sum (13.15), terms with pı = p2 = p3 = p, are also non-zero; 
they form a sum 


1 
Ta L ately By) exp (. +) 


but after the change to integration one factor 1/V remains, and this expression 
vanishes in the limit V + œ. 

This is clearly a general result: in the limit V + ©, only the results of paired 
contractions are non-zero in the mean value of a product of y operators. 

In the proof given, no essential use has been made of the fact that the aver- 
aging is with respect to the ground state, and it therefore remains valid for 
averaging with respect to any quantum state of the system. 


§ 14. The self-energy function 


The rules of the diagram technique formulated in §13 have an important 
property: the common factor in the diagram is independent of its order. Con- 
sequently, each “figure” in.the diagram has a definite analytical significance, 
whatever the diagram in which it appears, and can be calculated independently 
beforehand. In fact, we can calculate beforehand the sum of several figures 
having a definite number of external lines, and then substitute it as a “block” 
in more complex diagrams. This is one of the chief advantages of the diagram 
technique. 

One such block, which is also of considerable independent importance, is 
the self-energy function.’ In order to arrive at this concept, let us consider all 
the Green’s function diagrams that cannot be separated into two parts joined 
by only one continuous line. These include, for example, the two diagrams of 


t Compare the corresponding definition of the compact self-energy function in quantum 
electrodynamics (RQT, §§100, 102). 
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first-order perturbation theory (13.13), and the second-order diagrams (13.14 
a-f). All have the same type of structure: one factor iG% at each end, and an 
internal part (a function of P), called the self-energy function. The sum of all 
possible such parts is called the exact or complete self-energy function or the 
mass operator; we shall denote it by —i2_,(P). 

All diagrams of the self-energy type give a contribution to the Green’s 
function 


iGQ(P)L—iXp,(P)]iGO(P) = iG (P) E(P) GO (P)b.35 (14.1) 
where we have written G9 = G6,, and also 
Sup P) = 5ep5(P). (14.2) 


The complete Green’s function (represented graphically by a thick continuous 
line) is given by the sum of an infinite series 


re | ——_O-— +=-—OC-(_)+- + redy (14.3) 


where the circles denote exact self-energy functions —iZ,,. Each term in this 
series from the third onwards is a set of diagrams which can be dissected into 
two, three, ... parts joined by one continuous line. 

If we detach from each term of the series (14.3), from the second onwards, 
one circle and the line to its right, the remaining series is again the complete 
function. Thus 


—— = a + +O (14.4) 
Analytically, this is written 
G = G+GZGO (14.5) 
or, dividing by GG, 
: t —2(P). (14.6) 


G(P) GOP) 


We note that the sign of the imaginary part of 2 is the same as that of im G, 
and from (8.14) 
sen im S(, p) = —sgn w. (14.7) 


This follows from (14.6), since the sign of im G7! is the reverse of that of im 
G, and from (9.7) im [G9]? = 0. 

Thus the calculation of G reduces to that of X, which requires the use of a 
smaller number of diagrams. The number can be still further reduced, since 
some of the remaining diagrams can be summed at once in a very simple form. 

Let us select among all the diagrams that determine £ (with a pair interaction 
between particles) those which represent various “offshoots” connected to the 
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external lines by one broken line, and denote their sum by £. All such diagrams 
are present in one skeleton diagram’ of the form 


Da i 9 (14.8) 


The remaining part of X is denoted by +. For example, the following diagrams 
of the first and second orders belong to the first class: 


-f-o 





ata 
(a) (b) (c) 
and to the second class: 
O = aaa + PE E A AEA v + 
t 
(a) (b) sale OT ade 
c) (14.10) 
pe ma 
‘ x i 
Na pa aa O 
(d) (e) 


The thick loop in the diagram (14.8) corresponds to the exact density n(u) 
of the system, just as the thin loop in (13.13a) corresponds to the ideal gas 
density n” (u). It therefore follows from the definition (14.8) that 


—id, = —in(p) U(0). (14.11) 
Thus 
Z = n(p) U(0)+2), (14.12) 
and only the diagrams in 2, need be specially calculated. 
The quasi-particle dispersion relation is given by (8.16). Expressing G there 
in terms of £ by (14.6) and taking G™ from (9.7), we obtain the equation 


1 
G(e— Hy p) 


On the boundary of the Fermi sphere, where p = pp, the energy of the quasi- 
particle is equal to u. Hence we see that 


u—Z(0, pr) = pẸ/2m. (14.14) 


t As in quantum field theory, skeleton diagrams are those made up of thick lines and 
blocks; each such diagram is equivalent to a definite infinite set of ordinary diagrams of various 
orders. 


2 
= e(p)—5— = X(e— p, p). (14.13) 
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The dispersion relation therefore has the form (for p close to pp) 
e(p)— 4 = = (p-pr)+2(e— u, pr)—-E(0, pr). (14.15) 


We emphasize that pp here is the exact value of the limiting momentum for 
a system of interacting particles. It is related by p3/3x? = n to the exact density 


n(u), not to the approximate density n, as in (13.5). 


§ 15. The two-particle Green’s function 


Other important concepts in the diagram technique are reached by consider- 
ing the T product of four Heisenberg y operators, averaged with respect to 


the ground state: 
Koa, = (THE ETS). (15.1) 


This is called the two-particle Green’s function (as distinct from the single- 
particle Green’s function (7.9)). 

To apply perturbation theory and set up the diagram technique, we must 
again change to y operators in the interaction representation. As with the func- 
tion G, this leads to the appearance of the factor $ in the T product: 


Kyu. = G (THs Peres). (15.2) 


In the zero-order approximation (i.e. when § = 1) this expression becomes a 
sum of products of two contractions expressible in terms of G® functions: 


KY 2 = GOGY —GQaG®. (15.3) 


The subsequent discussion of the properties of the two-particle Green’s 
function thus defined will be given in the momentum representation. 

For a homogeneous system, K4 12 in fact depends only on three independent 
differences of the arguments, for example X3—X2, X1—Xe2, Xı— Xə. In the 
momentum representation, this property has the consequence that the Fourier 
component with respect to all the variables Xi, ..., X, contains a delta function: 


= (2m) 6 (P,+P,—P,—P2) Kyo, «p(Ps, P43 Py, P). (15.4) 


This is easily seen by noting that 


PX, + PX — PiX — PX, 
= P3(X3—X2)+Pa(X4—X2)—P1(X1— X2)— Xx P1 +P2—P3— P1), 


t We are again using the simplified notation in which the suffixes 1,2,. .. denote the 4-coor- 
dinates together with the spin index: Xa, X,8, ... ; cf. the second footnote to §13. The full 
notation is shown by 

Ky. 12 = Kye, ap (Xz Xa; X1, Xo). 
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and changing to integration over X3— X4,X4—X2, X1— X2, X2. It may be noted in 
passing that the inverse Fourier transform may be written 


K34, 12 = J Kyo, ap(Ps, Pa; Pi, P3+Pa—Pi) X 


HP, d*P, dP 
xexp {—iLPs(Xs—X2)-+ Pa(X4—X2)—Pi(X1— X2)]} aay . (15.5) 


The function K, ,(Ps, Pa; Pi, Pa) defined in this way will be called the two- 
particle Green’s function in the momentum representation: its arguments are 


related by 
P3+ Po. = P+ Pa. 


In the zero-order approximation, in agreement with (15.3), we have 


KY .p(Ps, Pi; Pi, Po) 
= (2n) (P,— Ps) CP.) GP(P2) — 8 (Pi— Py) CHP) GXP), (15.6) 


i.e. K reduces to a sum of two products of single-particle Green’s functions. 

In higher approximations of perturbation theory, terms appear which amount 
to corrections to these single-particle functions, together with terms that do not 
form products of G functions. This part of the two-particle Green’s function 
is of independent interest. To derive it, we put K in the form 


Kusas, aas (P3, Pa; Pi, Po) 
= (2P (P1 — Ps) G uga (P1) Gara (P2)— 
— 8V (Pa — Pa) Gasal P2) Gara (P1)1+ 
+ Gasba P3) Gaag (PAi a6., p16 P3 Pa; Pi, P2) Gp,0(P1) Gpyag(P2). (15.7) 


The function I’ thus defined is called a vertex function. 

According to the definition (15.1), a two-particle Green’s function in the 
space-time representation is antisymmetric with respect to interchanges of 
arguments (together with the spin suffixes) in the first or second pair: 1 and 2, 
or 3 and 4. Hence we have the analogous symmetry property for the Green’s 
function and the vertex function in the momentum representation: 


I, «p(Ps, Pa; Pi, P2) = — Toy, ap(Pa, Ps; Pi, Pe) 
= — La, pe( Ps, Pa; Po, Pi). (15.8) 


The reason for separating the four G factors in the definition of J’ (the last 
term in (15.7)) becomes clear if we trace back the nature of the diagrams that 
arise when the expression (15.2) for the two-particle Green’s function is ex- 
panded. The analysis below again assumes a pair interaction between particles. 

In the zero-order approximation, the function K is assigned diagrams 


P = Pı P = Pı 


Py = P: P3 = Pe 
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corresponding to the terms in (15.6). In first-order perturbation theory, dia- 
grams appear of the types’ 


representing corrections to each of the factors in (15.6). There also appear 
diagrams that do not separate into two parts: 


R R 
Ree Ramee 
. | + l (15.9) 
Awe E 
g F, 


The four arrows Ph, ..., P, correspond to the four G factors in the last term 
in (15.7), and the internal part of the diagrams determines (in first order) the 
vertex function, the circle on the left of the diagram equation (15.9). Writing 
these diagrams in analytical form, we have 


TY (P3, P,; Pı, Po) ae 5x,5psU(P1 — P3) + 5285yU(P1—Ps). 


The diagrams of higher orders contain corrections of three types: (1) further 
corrections to two unconnected continuous lines, (2) corrections of self-energy 
type to external lines in the diagrams (15.9), (3) corrections forming a figure 
that replaces the broken line in the diagrams (15.9); the sum of all possible 
such figures gives the exact vertex function iJ’. In the graphical representation 
of the two-particle Green’s function by a sum of skeleton diagrams, 


Ps A 
PzP ASR 
+ + (15.10) 
f= FP, Arh 
fa Po 


the thick lines represent exact G functions, and the circle conventionally 
represents the vertex function. 

The calculation of the vertex function in various orders of perturbation 
theory must be made by means of the diagram technique rules formulated 
in §13, and diagrams with four external lines are to be considered (rather than 
those with two as in the calculation of G). Rule (3), which gives the sign of the 
whole diagram, is to be supplemented by the following point; if external lines 


t As for the single-particle Green’s function, the factor (Sy! in the definition (15.2) leads 
to the vanishing of diagrams that contain detached closed loops of continuous lines. 
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1 and 4, and 2 and 3, are joined by sequences of continuous lines (instead of 
1 and 3, and 2 and 4), the sign of the diagram is reversed. 

As an example, the following are all the diagrams that determine the vertex 
function in second-order perturbation theory: 


P Ng 7 
A A 


y i eX /' (15.11) 


(a) 
1 “x 
| + 4} + (3064) 
i OK 
4 2 4 ‘2 


4 2 (d) (e) 
(c) 


+ 


The self-energy function 2' and the vertex function I are not independent; 
they are related by a certain integral equation called Dyson’s equation.” 

To derive this, we use equation (9.5), which is valid (as mentioned in §9) 
even when the interaction of the particles is taken into account. There is a 
difference from the derivation in §9, however, in that the y operator now satis- 
fies equation (7.8). Omitting in the latter the term containing the external! field, 
and substituting from it the derivative 0W/dt, in (9.5), we obtain 


(is Srta) Gaal X1— Xo) — 8r (X1— Xo) 


Or 1 2m 
= —i | (TÊ (X3) U(X, — Xo) ¥ (Xs) dX PAX) PXD 
= —i | Kyz, yp(Xs, X1; Xs, X2) Urad! X3. (15.12) 


This equation solves the problem in principle, since K is expressed in terms of 
I’ by (15.7). We have only to change to the momentum representation. To do 
So, we multiply (15.12) by exp [iP(Xı— Xə)] and integrate over d(Xı— Xə), 
taking Kg) go in the form (15.5) and U43 in the form (13.9). Then the integration 
with respect to 4-coordinates gives delta functions, which are removed by the 
integration with respect to 4-momenta. The result is 


[G-1(P) G(P)—1] dep 


dP, d*P 
= —i | K,s,yp(Ps, Pa; Ps+P4—P, P) U(P— Ps) — 2 —* 


Gas? (15:13) 


with GP) from (9.7). 


+ It is analogous to Dyson’s equation in quantum electrodynamics (see RQT,§104). 
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It now remains to express K in terms of I’. Substituting (15.7) in (15.13), we 
finally obtain Dyson’s equation in the form 


Sxp[GO-4P)—G"XP)] = dap2(P) 
= U(0) n(u) bastides | U(P—P;) G(P1) on 
+ f Dy2,yp(Ps, Pas P3+P4—P, P) G(Ps) G(Ps) G(Ps+Pa—P) X 


4 
x uP-P) 3 (15.14) 


Here n(u) is the exact density of the system as a function of its chemical poten- 
tial; this factor comes from the integration of G by formula (7.24), together 
with the fact that the G function arose from a contraction in which ¥* is to 
the left of Ë. The first term on the right of (15.14) is Z, (14.11). 


§ 16. The relation of the vertex function to the quasi-particle 
scattering amplitude 


The mathematical formalism developed in the preceding sections makes 
possible a rigorous justification and fuller understanding of the significance of 
the fundamental relations in the Landau theory of the Fermi liquid, which have 
been introduced in Chapter I in a partly intuitive form. This topic will be the 
subject of §§16-20.' 

There is a close relation between the vertex function and the mutual scattering 
amplitude of quasi-particles. For a better elucidation of this relation, let us 
consider it first in terms of the purely quantum-mechanical problem of the 
scattering of two particles in a vacuum. 

In quantum mechanics, diagrams with four external lines (two ingoing and 
two outgoing) correspond to a collision of two particles; in the analytical form 
of the diagram, its external lines correspond to the wave function (plane wave) 
amplitudes of free particles (cf. ROT, §103). Let us see how such diagrams of 
different orders in fact give successive terms in the ordinary (non-relativistic) 
Born expansion of the scattering amplitude. 

First of all, in a vacuum many of the diagrams are Zero. This is most simply 
understood in the coordinate representation, since in a vacuum all contractions 
of the form (¥/*+¥) are zero in which the annihilation operator is to the right 
and acts first on the vacuum state; only contractions of the form (WY+) 
remain. Hence all diagrams with closed loops of continuous lines are zero, 
since they always contain a contraction of the form ($+$). For the same 
reason, all corrections to the Green’s function, i.e. to the internal continuous 


t The content of §§16-18 is due to L. D. Landau (1958), and that of §§19 and 20 to L. D. 
Landau and L. P. Pitaevskil (1959). 
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lines in the diagrams, are zero.’ Lastly, diagrams with intersecting broken lines 
are zero; for example, in the diagram 


l 
Ve” 
Pg R 
NOSO A Sennen 
2 I 


(where 1 and 2 denote the arguments t; and fe) the upper internal line cor- 
responds to the contraction (21) = 0 if tə > tı, and the lower line to 
LP,» = Oif te < t. 

Thus, for two particles in a vacuum, there remain only the following dia- 
grams forming a “ladder series”: 


(16.1) 
LP ee crear R Daan mas + 
= í + tt + 1 l E $ee (3m 4) 
P, ~—~— P, pO PONE POO P N beg 


The internal continuous lines here correspond to the vacuum Green’s functions 


—~1 


P, 
(vac) = os Gena . 
G8-)(q, p) [o 5 +i0] (16.2) 


(formula (9.7) with u = 0). It should be noted that (because u is absent from 
the denominator) the pole of this function is always in a particular (the lower) 
complex w half-plane. The vanishing of the diagrams listed above occurs, 
from the mathematical point of view, precisely because all the poles of the 
integrands lie in one half-plane; the vanishing of the integrals is obvious if the 
path of integration is closed in the other half-plane. 

The ladder series (16.1) can be summed by reducing it to an integral equation 
(cf. the summation of the similar series (17.3) below). If the diagrams with 
interchanged external lines 3 and 4 are at first omitted, this equation is equiv- 
alent to Schrédinger’s equation for two particles, ignoring their identity, 
written in the momentum representation; see QM (130.9). Accordingly the 
vertex function is expressed in terms of the scattering amplitude f of the 
two particles by 

Lys, ap(Ps, P,; Pi, Pa) = bxy5p0(47/m) f. (16.3) 


The addition of the diagrams with interchanged external lines 3 and 4 brings 
about the antisymmetrization of the amplitude, as is correct for fermions. 


t The vanishing of all corrections to the Green’s function in the vacuum simply expresses 
the fact that a single particle cannot interact with anything. Here it may be recalled that the 
existence of vacuum corrections to the Green’s function of a particle in the relativistic theory 
is due to the possible occurrence of virtual electron pairs or photons in intermediate states. 
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In the first approximation of perturbation theory, only the first diagram (16.1) 
and the one with interchanged external lines remain; these do not involve 
G^., The expression for the scattering amplitude is then the usual one in the 
first Born approximation. The subsequent diagrams, after the integration over 
intermediate frequencies, give the familiar expressions for the corrections to 
the amplitude in the subsequent Born approximations. 

In a Fermi liquid, the interaction of the colliding particles with the particles 
of the medium causes them to be effectively replaced by quasi-particles. All the 
corrections to the internal lines of the diagram resulting from this interaction 
are automatically taken into account by the definition of the function I. A 
further allowance must, however, be made for the corrections to the external 
lines. In quantum field theory (by virtue of the general requirements of a unitary 
scattering matrix), these corrections are shown to cause a factor ~/Z to appear 
in the scattering amplitude for each free external line, where Zis the renormal- 
ization constant of the Green’s function (see ROT, §107); for diagrams with 
four external lines, this means multiplication by Z?. Although the proof given 
in RQT is valid also for quasi-particles in a Fermi liquid, we shall here explain 
the origin of this factor by simpler (but not rigorous) arguments. 

The Green’s function of a liquid, near its pole (the first term in (10.2)) differs 
from that for an ideal gas only by the factor Z. If ¥ and ¥* are replaced by the 
operators Pu = W/,/Z, yr = W+t/,/Z, the Green’s function Gy, = G/Z 
formed from them will look exactly like that for an ideal gas, near the pole. 
In this sense such operators may be regarded as y operators of an ideal gas of 
quasi-particles. The two-particle Green’s function determined from them is 
K,, = KIZ, and therefore, by the definition (15.7), the vertex part I, = TZ, 
as required. 

In the application to quasi-particles, what is of interest is the number of 
collisions (per unit time and liquid volume) rather than the collision cross- 
section. For collisions with a given change in the momenta and spin compo- 
nents of the particles (pi«, p28 — psy, p,6), this number is 


dW = 27| ZLo, ep(P3, Pa; Pi, Po) |? 5(€3+ €4— €1— £2) X 
X Np, Mp,(1 — Mpa) (1 —MNp,) Bp, Epo dp3/(22)°, (16.4) 


where pi+P2 = p3+p,, and n, is the quasi-particle distribution function. The 
factors n, and n, simply express the fact that the number of collisions of quasi- 
particles with given initial momenta (and spin components) is proportional to 
the numbers of such quasi-particles per unit volume. The factors (1—7,,) 
and (1—n,,) are due to the fact that, in accordance with the Pauli principle, 
a collision can occur only if the final states are unoccupied. 


§ 17 The Vertex Function for Small Momentum Transfers 63 


§ 17. The vertex function for small momentum transfers 


An important role in the theory of Fermi liquids is played by the vertex 
function with almost equal values of the pairs of variables Pi, Ps and Po, P, 
(we shall see, in particular, that it is closely related to the quasi-particle inter- 
action function). Using the relation Pi+P2 = P3+ Py, we put P; = Pit+K, 
P, = Pe—K, and write in simplified notation 


Lys, «p(P1+K, Po—K; Pi, P2) = Tye, ap(K; Pi, Po); (17.1) 


this function will be considered for small K. In terms of quasi-particle scattering 
processes, this means that we consider collisions with a small transfer of 4-mo- 
mentum, which are close to “forward scattering”. 

When K = 0, as we shall see, the function J" has a singularity; we shali be 
interested in the part of the function that contains this singularity. The origin 
of the singularity is easily understood from the skeleton diagram 


(17.2) 


Pi +K P-K 


which includes the set of diagrams of the two-particle Green’s function that 
can be cut between the pairs of external lines Pi, Ps and Po, P, into two parts 
joined by two continuous lines.‘ The two thick joining lines correspond to the 
exact one-particle Green’s functions G(Q) and G(Q+ K), with integration over 
the 4-momentum Q in the diagram. As K — 0, the arguments of these two func- 
tions become closer, and therefore so do their poles. These may “pinch” 
between them the contour of integration (see below), which is the source of 
the singularity in the function I. 

To calculate the exact function l, we must sum the whole perturbation- 
theory series. Since our aim is to separate the part that has a singularity when 
K = 0, we must first distinguish the contribution from all diagrams that cannot 
be cut through pairs of continuous lines having almost equal (differing by K) 
values of the 4-momentum. This part of the function I’, which has no singu- 
larity at K = 0, is denoted by J’; in it we can put K = 0, since it is a function 
only of the variables P, and Pz: I’, .«(P1, P2). The “dangerous” diagrams can 
be classified by the numbers of pairs of lines with almost equal arguments 


t For example, in second-order perturbation theory (with respect to the pair interaction), 
(17.2) contains the diagrams (15.11a, b, c), and (15.1le) with interchanged external lines 
3and 4. 
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which they contain. Thus the total vertex part I" is represented by the following 
infinite ladder series of diagrams: 


iC JOYS om 


P+K B-K 


Here the white circle corresponds to the required iI’, atid the shaded circles 
represent iJ”. The external lines in these diagrams do not entèr into the deter- 
mination of J’, and serve only to indicate the number atid values of the ingoing 
and outgoing 4-momenta. 

All the internal lines in the diagrams (17.3) are thick, i.e. they correspond to 
exact G functions. Here it should be emphasized that the possibility of repre- 
senting I" in the form of these skeleton diagrams (and thefefore all the conclu- 
sions drawn from them) does not presuppose a pair interaction between par- 
ticles, since there are no explicit broken lines, and the nature of the interaction 
actually affects only the internal structure of the blocks represented by circles, 
which is of no interest in this connection.* 

The problem of summing the series (17.3) amounts to the solution of the 
integral equation; to derive this, we “multiply” the whole series by a further I’: 


Comparison with the original series (17.3) gives the equation 


P BE P P P, P 
t o+K 2 \ 2 I 2 
. Bed Š 4 (17.4) 
noe °? Èk RfK Pek PEK PAK 


This diagram equation, when written in analytical form, gives the required in- 
tegral equation 


Lys, ap(K; Py, Po) = Pp, ap(Pis Pa)—i f Lrt. ax(Pi, Q) G(Q+K) G(Q) x 
xX Dea, (K; Q, Pa) d*Q/(27)!. (17.5) 


In accordance with the above discussion, we have put K = 0 in the functions I’, 
used the abbreviated notation for T' and I" previously described, and also put 
Gag = Go.5- 


t Only such general properties as the conservation of particle number are assumed. This 
latter is shown by the constant difference between the number of lines going right and left 
at each cross-section of the diagram (equal to zero for cross-sections of the type shown in 
(17.3)). 
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To investigate this equation, let us first consider the product G(Q+ K)G(Q) 
in its kernel. As already mentioned, for small K the poles of the two factors 
are close together. Near these poles, the G functions are represented by the 
pole terms in (10.2). Denoting the components of the 4-vectors K and Q by 


K= (9, k), Q = (go, q), (17.6) 
we can write in this region 


G(Q) GQ+K) ~ Z2*[go— vr(q—pr)+ ið] t [go+@— vr(lq+k!—pr)+ iô] t, 
(17.7) 


where 6; and 6, are infinitesimal increments whose signs near the poles are given 
by 
sgn 01 = sgn(q—pr), (17.8) 
sgn 62 = sgn(|q+k/|—p). 


The signs of 6; and 62 determine the position of the poles in the upper or 
lower half-plane of the complex variable qo. The singularity in the kernel of 
the integral equation, and therefore in the solution of the equation, arises from 
the pinching of the contour of integration with respect to qo (the real axis) 
between the poles, for which the latter must be on opposite sides of the contour, 
i.e. in opposite half-planes. 

Let us first suppose that q.k > 0, i.e. cos 0 > 0, where 6 is the angle be- 
tween q and k. Then |q+k! > q, and ô, and ô» have opposite signs (6, < 0, 
62 > 0) if q < pr, |q+k|> pr, which, in view of the smallness of k, is equiv- 
alent to the conditions 

pr—k cos 6 < q < pr. (17.9) 


In the subsequent integration with respect to qo in (17.5), the contour of inte- 
gration may be replaced by an infinite semicircle in either the upper or the lower 
half-plane; the integral is then given by the residue of the integrand at the cor- 
responding pole. Because of the narrowness of the range (17.9) when k is small, 
we can take k = 0 in the factors I’ and Í in the integrand, and similarly 
go ~ 0 for the position of the poles when k and w are small. 

In other words, as regards its role in the kernel of the integral equation 
(17.5), the product of pole factors (17.7) is equivalent to the delta functions 
A6(qo) 6(¢— Pr), with a coefficient A given by 


Gets Z? dqo dq 
J [go— vr(q—pr)+ iôi] [go +0 ~ vr(q+k|—pr)+ iô] 


When q is outside the range (17.9), both poles lie in the same half-plane of 
complex qo; when the contour of integration with respect to qo is completed in 
the other half-plane, we see that the integral] jis zero. In the range (17.9), 
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completing the contour in one half-plane and calculating the integral from the 
residue at the pole in that half-plane, we find 
Ais [ 2niZ? dq 
~ | @—vp(\qt+k! —q)+i0 ’ 


where we have used the fact that 6, < 0 and 62 > 0 in the range (17.9). Since, 
by (17.9), q ~ pp > k, we can put |q+k!—q ~ k cos 9, and then, with the 
limits given by (17.9), 

A = 2niZ*k cos 0/(w—kvs cos 9). 


It is easy to prove by the same method that a similar expression for A (but 
with the opposite sign of i0) is obtained when cos 0 < 0 (when the integration 
is to be taken over the range q > pr, |q+k! < pr). Thus we have in the kernel 
of (17.5) 
2niZ1.k5(qo) 6(q—Pr) 


o odktOsmo 2> (17.10) 


G(Q) G(Q +K) = 
where I.k is written in place of k cos 0 (l = q/q), and the function ¢ has (when 
K is small) no delta-function part, and we can therefore put in it K = 0. 

Substituting (17.10) in (17.5), we get the basic integral equation in the form 


Ty, «a(K3 P,, Py) E Lys, a8 Pis P2) 
—i f Le.a(Pu 0) A(O), eo(Ki Q, Ps) PONa) 


2 
Mi | Pre wx(P1, Or) F'xs, eo K; Or, P2) en (17.11) 


In the last term we have written d4Q = q? dq do, dqo (where do, is the element 
of solid angle in the direction of I) and have removed the delta functions by 
integration over dq dqo. In this term the argument Q in the functions I’ and I’ is 
taken on the Fermi surface: Qp = (0, prl). 

The factor L.k/(@—v,1.k) in the kernel of (17.11) has a specific property: 
its limit ask — 0 and œ — 0 depends on the limit of the ratio w/k. The solution 
of the equation must therefore have the same property: the limit of the function 
T(K; Pı, P2) as K - 0 depends on the way in which w and k tend to zero. 

Let °(P, P2) denote the limit 


DP y6(P1, Po) = lim Py, .p(K; Pi, Pe) for k/o — 0; (17.12) 
K-»0 
we shall see in §18 that the quasi-particle interaction function is related to 


this quantity. With that method of taking the limit, the kernel of the last integral 
term in (17.11) is zero, and so I satisfies the equation 


To, ap P15 Po) = Dy9,x6(P 1, P)—i f Dye, al Pi, Q) p(Q) Tò, ca(Q, Po) dO/(22)4. 
(17.13) 
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Because of (15.8), 
Des, «p(P1, Po) = U5, pa(P2, P1). (17.14) 


We can eliminate I from the two equations (17.11) and (17.13). The result is 


Ly, ap(K; Pi, P) = = T9, ap(P1, Po) 


Zp} 


ra” vt, ax(P1s Or) xo, ep K; Or, P) -Zok ie 


Taik 0719 


since, if we formally write (17.13) as Ž = LI, (17.11) becomes 


Zp} Lkdo; 
irott f rr tke 


Substituting here [ = LI and applying the operator L~! to both sides of the 
equation, we get (17.15). 
We now define the function I* by 


Is, 2p(Pi, Pe) = lim Tye, 29(K3 Pi, P2) with o/k +0. (17.16) 
K—0 


This function (multiplied by Z?) is the forward scattering amplitude (i.e. that 
of the transition P1, P2 — Pı, P2), corresponding to actual physical processes 
undergone by quasi-particles at the Fermi surface: collisions that leave the 
quasi-particles on that surface are accompanied by a change of momentum 
without change of energy, and therefore the passage to the limit of zero mo- 
mentum transfer (k — 0) must be made with exactly zero energy transfer 
(œw = 0). The function [ defined above corresponds to the non-physical limit 
of “scattering” with a small energy transfer and exactly zero momentum trans- 
fer (k = 0). 

Putting w = 0 in (17.15), taking the limit k — 0 and multiplying both sides 
by Z7, we get 

TS, ap( Pi, P2) = = ZT, ap( Pi, Po) 


E bf Th API QF) ZT, lOr, Pa)dor. (1717) 


Thus there is a general relation between the two limiting forms of the forward 
scattering amplitude. 

The antisymmetry properties (15.8) for T' give some information about the 
behaviour of I“ and I” when P, > Ps. Putting P, = Pz and « = £ in this 
equation, we get 

I,s,«(P1+K, Pi—K; P1, Pi) = 0; (17.18) 


there is no summation with respect to « here.‘ The transition to T® or I* 
t When only the exchange interaction between the quasi-particle spins is taken into account, 


the only non-zero lL’, «a are the Myx, aa. This expresses the constancy of the spin vector in scat- 
tering, and may be verified directly from an expression of the type (2.4). 
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in this equation is to be made with caution, since in the latter functions we first 
put K = 0, but in (17.18) we first put P, = Pe. 

Let K and P;—P2 = S = (so, s) be simultaneously small. Then, as well as 
the diagrams (17.2), the diagrams 


B-K Pee 


are dangerous. When K and S — 0, the function I’, ,, will therefore depend on 
the two “singular” arguments 


x=olk, y = (Soto@)/|st+kl, 


and (17.18) shows that this function is zero when x = y. We shall consider 
the values of on the Fermi surface; then wœ = so = 0, and so y = 0. Hence, 
in this limit, (17.18) is valid only if also x = 0. Thus, on the Fermi surface it is 
valid for J*: 

I¥s,ax(P1, P1) = 0 (17.19) 


(N. D. Mermin 1967). 


§ 18. The relation of the vertex function to the quasi-particle interaction 
function 


Just as intermediate states with particle numbers N+1 are involved in the 
matrix element (7.9), which determines the one-particle Green’s function, so 
intermediate states with N, N41 and N+2 particles are involved in the matrix 
element (15.1) of the two-particle Green’s function." 

Because of the presence of intermediate states with N+1 particles, the two- 
particle Green’s function has poles which coincide with those of the function G, 
i.e. with the quasi-particle energy. The corresponding factors are, however, 
Shown explicitly in (15.7). Hence the vertex function I’ defined by this formula 
only has poles corresponding to states with N or N+2 particles. The angular 
momentum of these states differs by 0 or 1 from that of the ground state, and 
so the elementary excitations corresponding to these poles have integral spin 
(0 or 1) and hence obey Bose statistics. Thus the poles of the vertex function 
determine the Bose branches of the energy spectrum of a Fermi liquid. 


t States with N particles arise with such a sequence of operators in the T product as, for 
example, P, Pt P, 2. States with N+ 2 particles correspond to such sequences as Y PY, Pt MP. 
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The poles arising from the intermediate states without change of particle 
number correspond to elementary excitations that represent zero sound quanta. 
In the diagram technique, intermediate states correspond to different cross- 
sections that divide the diagrams into two parts between various external lines.. 
In the present case, intermediate states without change of particle number 
correspond to cross-sections of the diagrams (17.3) at one of the pairs of 
continuous lines joining adjacent blocks I’; the constancy of particle number in 
these states is expressed by the equal numbers of lines passing in each direction 
through the cross-section. The 4-momentum transfer through such a cross- 
section is (0+K)—Q = K; accordingly, the elementary excitations without 
change of particle number correspond to poles of the vertex function T(K; 
Pı, P2) with respect to the variable K. 

We have seen previously, in the derivation of (17.10), that one of the two 
momenta q and q+k (which appear in the 4-vectors Q and Q+ K) must be 
greater than the limiting momentum pp, and the other must be less. On the 
other hand, in excitation from the ground state, only “particles” can be outside 
the Fermi sphere, and only “holes” within it. In this sense, we can say that the 
zero-point excitations in a Fermi liquid may be regarded as particle-hole bound 
states.’ 

Elementary excitations corresponding to intermediate states with N+2 par- 
ticles (and to the poles of the function I'(K; Pi, P2) with respect to the variable 
P,+P:) could be regarded as bound states of two particles or two holes. 
The presence of such states would, however, lead (as will be shown in Chap- 
ter V) to superfluidity of the Fermi liquid, and this in turn necessitates a consid- 
erable change in the whole mathematical formalism of the diagram technique. 

Thus, to determine the Bose branch of the energy spectrum of a non-super- 
fluid Fermi liquid, we must examine the poles of the vertex function T(K; 
Pı, P2) with respect to the variable K = (w,k). For each value of k, a partic- 
ular energy w(k) corresponds to the pole, and the dispersion relation for these 
excitations is thereby determined. For weakly excited states, œ and k are small, 
so that we can use the equations derived for the function T(K; Pı, P2) in 
the range of small K. 

Near a pole of I’, the left-hand side and the integral on the right-hand side 
of (17.15) become arbitrarily large; the term I(P1, P2) remains finite, and 
may therefore be omitted. Moreover, the variable P> and the suffixes 6 and 
ô are unaffected by the operations on I’ in (17.15), and so they act as unimpor- 
tant parameters in that equation. Lastly, we shall consider the function I’ on 
the surface of the Fermi sphere, i.e. we shall put P, = (0, ppn), where n is a 
variable unit vector. From all these facts we conclude that the determination of 


t In this formulation, the problem is formally very similar to that of determining the elec- 
tron-positron bound state levels in quantum electrodynamics (see ROT, §122). In particular, 
equations (17.4) and (17.5) are analogous to the Bethe-Salpeter equation, RQT (122.10), 
(122,11). 
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the acoustic excitations in a Fermi liquid reduces to the problem of finding the 
eigenvalues of the integral equation 


Lkdo; 


w— vrl.k’ (Sl 


2 
y(t) = a f 2%, cult, D Ze) 


where y,,,(m) is an auxiliary function. 
This equation may be transformed by replacing y by another function 


v„a(0) = aes Lya(M). (18.2) 


Equation (18.1) then becomes 


(@— vpn.k) ?y.(0) = yt, «xCD, D) »,e(0’) do’, (18.3) 


k.n ZEL ne f yes 
with n’ in place of I. 

This equation has exactly the same form as the transport equation (4.10) 
for the vibrations of a Fermi liquid. Comparison of the two equations gives 
the following correlation between the quasi-particle interaction function and 
the function I’: 


Sys, apl Pru, pra’) = ZY, «p(n, n’). (18.4) 


This shows the relation between the function f and the properties of quasi- 
particle scattering.’ 

Equation (18.4) relates f to the non-physical scattering amplitude. We now 
use (17.17) to obtain an explicit relation between f and the “physical” forward 
scattering amplitude for quasi-particles on the Fermi surface, which we denote 
by 

Ays, «pl, n2) = ZI% «pM, n2). (18.5) 


The relation (17.17) on the Fermi surface is 
Ays. «ap(Dı, n2) 


P? j , n do' 
= fys, «pD, Me) — Ifor Jyt, ax(R1, 0") Axs, p(n’, Me) az: (18.6) 


The spin dependence of the functions A and f can be expressed by means 
of the Pauli matrices o. In the general case, these functions may contain any 
scalar combinations of the four vectors nı, Me, 61, 62. If there is an exchange 


+ The above general proof is due to L. D. Landau (1958). For a slightly non-ideal Fermi 
gas, the derivation of the transport equation by summation of specific diagrams of the type 
(17.3) was earlier given by A. B. Migdal and V. M. Galitskii (1958). For a gas, the G functions 
(in the zero-order approximation) contain only pole terms, and so the exclusion of non-pole 
terms does not arise. 
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interaction between the particles, the only permissible scalar products are 
üı.B2 and 61.62. The functions Aand f can then be written (as with fin (2.4)) as 


Ta fyd, apl, n2) = F(A) 8x, 5g0+ C(O) O,a+Sags 
F (18.7) 


a Aya, ap(ı, n2) = BA) 5x00 + C(@) Gyx-898, 
where the coefficients F, G, B, C, are functions only of the angle # between 
n, and nz. They may be expanded in series of Legendre polynomials: 


B(8) = Ş (2/+1) ByPi(cos 8), ... (18.8) 


i=0 


Substituting (18.7) and (18.8) in (18.6), and calculating the integral with the 
addition theorem for Legendre polynomials, we get 


Bı = F(1—B), Cr = G(1-C)). (18.9) 


These formulae establish a simple algebraic relation between the expansion 
coefficients of fand A. 
The stability conditions (2.19) and (2.20) give similar inequalities for the 
coefficients B, and C;: 
Bi< 1, C, <1. (18.10) 


Moreover, these coefficients satisfy a relation that follows from (17.19): B(0) + 
+ C(O) = 0, or 


y (2/+ 1) (Bi+C)) = 0. (18.11) 
i=0 


Equations (18.9) and (18.11) together with the conditions (18.10) are sufficient 
to prove an interesting theorem: in every stable Fermi liquid, there is at least 
one branch (ordinary or spin) of axially symmetric zero sound.‘ 


§ 19. Identities for derivatives of the Green’s function 


In the mathematical formalism of Green’s functions, an important part is 
played by certain identical relations between the derivatives of these functions 
and the quasi-particle scattering amplitude. These relations are all derived in 
the same way by calculating the change in the Green’s functions caused by some 
fictitious “external field” for which the result of its action on the system is 
already known. 


t See N. D. Mermin, Physical Review 159, 161, 1967. 
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First of all, then, let us calculate the change ôG in the Green’s function 
caused by an arbitrary “external field”, for which the corresponding term in 
the Hamiltonian is 


SPD = f Plt, r) 60%(1,r) dx, (19.1) 


where ôÛ is some operator acting on functions of r (and possibly depending 


on the time 1). 

When the external field is present, the Green’s function depends on the two 
4-momenta P, and Pz. In the diagram technique, such a field is represented by 
a new graphical feature, an external broken line: 


Rondi E 
and this line is associated with a factor 
—iôU(Pa, P,) = —if e'P2* 5Oe-!P:* d4X, (19.2) 


In the first order with respect to the external field, the correction to the exact 
Green’s function is represented by a sum of two skeleton diagrams: 
P 
| - 
IBIR, A) = pamm] an P, + —— (19.3) 
Q 
P, ! 
where all the continuous lines are thick (exact G functions) and the circle is an 
exact vertex function (il). In analytical form, this equation is 


IG pa( Po, P) = Gz,(P2) dbU(P 29 P 1) Gy(P. 1)— 
—iG5,(P2) Gea(P1) f To, (P2, Q1; Pi, Q2) X 
X SU (Q2, Q1) Gex(Q2) Gua(Q1) d'O (27), (19.4) 


with Q+ Pı = Pot Q1. 

The first two identities to be considered are due to the conservation of the 
number of particles in the system. In the Hamiltonian of the system, this prop- 
erty is expressed by the appearance of the y operators in pairs: one ¥*+(X) 
and one Y/(X) for each argument X. 

We apply a gauge transformation to the » operators: 


WX) = DUX), Pi = Wi t+ em, (19.5) 


where ¥(X) is a real function.’ From the above-mentioned property of the 


t This is analogous to the gauge transformation in quantum electrodynamics; cf. QM 
(111.2)-(111.9). 
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Hamiltonian, if Ý satisfies the “Schrödinger equation” (7.8), $” satisfies a sim- 
ilar equation with the changes 

ð ð 21 
> —j 2 —— oo 
A (VIN, op OE aE 


For an infinitesimal y = ôy, this change in the equation is equivalent to adding 
to the Hamiltonian an “external field” 


80 = _ 06% 


zar + (aðr +2vôz. V). 


In particular, if 
ôx(X) = re yoe™'®*¥, K= (w, k), 


where the symbol re can in fact be omitted, since the subsequent operations 
are linear, we have 


1 
ôU (Pe, Pı) = i(2x)4 40d) P2—Pi—K) fo BA k. (pı -+ Pat . (19.6) 


On the other hand, the Green’s function constructed from the » operators 
Yi =Wi+idy), Wit = PF idx) 
differs from that constructed from Y and Ý+ by 
8Gup(X1, X2) = iGup(X1 —X2)[64(X1) — 6y(X2)] 
or, in Fourier components, 
5Gep( Po, P1) = f Gael Xi, Xo) elP1%—-PiXD dtX, d*Xe 
= i[Gup(P1) — G.p(P2)] ôx(P2— P1), (19.7) 

where 


by(P) = f dx(X)elP* dX = (27) xo(P— K). 


Thus the same change 6G,, has been expressed in the two forms (19.7) and 
(19.4), where ôU is to be substituted from (19.6). Equating these two expres- 
sions, we get, after putting G,, = Géb,, and renaming some of the variables, 

k.(2p+k) 
gay 


dul G(P-+ K)— G(P)] = GIP +K) GCP) {|- ee 


i k.(2q—k)] dQ 
t (Tuk: P, 04) GO- K) [o-a] EE. 
The required identities are obtained by taking the limit of this equation as 
w + 0,k — 0; then 
oG 0G 


G(P + K)—G(P) + œw — +k. ; 19.8 
(P+K)—G(P) + oF +k. 5 (19.8) 
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where P = (po, p). Taking the limit with the condition k/w > 0, we get the 

first identity: 

OG(P) _ 
OPo 





7 (Px [burt (Thal, O ye 099 


with the notation 
{GAP)uo = lim G(P)G(P+K), k/w — 0. (19.10) 
w, k — 0 


Similarly, taking the limit with the condition w/k + 0, we get a second 
identity: 
0G 


Oup ap 


= P 3i q 
= (PP [E bagi frh, AP, 0) {GQ} | (19.11) 
with the corresponding notation {G?(P)},. 

Next, let us consider the change in the Green’s function when a constant 


field 
80 = U(r) = Ue -" (19.12) 


is applied to the system. When k — 0, this field varies slowly in space, and so 
its influence on the system can be treated macroscopically. According to the 
thermodynamic condition of equilibrium in an external field, we must have 
u+ôU = constant (see Part 1,§25); when k - 0, this means that the chemical 
potential u changes by the small amount — Uo. The corresponding change in 
the Green’s function is 


6G. p( Xi, Xo) = — Uoô«g 0G(X, —X,)/Opu, 
and its Fourier component, defined as in (19.7), is 
bGap(Po, Pı) = = (22) SHP, — Py )U oa p0 GCP: )/ eu. 


The same change in the Green’s function can also be calculated from equa- 
tion (19.4), this time with 


bU( Ps, P,) = (27) U8(P,—P,-K) (K = 0,k). 
The passage to the limit k ~ 0 in this case (constant field, œ = 0) corresponds 
to the case w/k — 0. This gives the identity 


0G(P) 


ba 5 





= (GP) [ei f Thy aP, ONGO) | _ (19.13) 


Lastly, one more identity results from the Galilean invariance of the system. 
To derive it, let us consider the liquid in a system of coordinates moving with 
a small velocity dw(t) = woe~‘ that varies slowly with time. The change to 
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such coordinates is equivalent to the imposition of an external field whose 
operator ist 


60 = —dw. p = iôw. y (19.14) 


or, in the momentum representation, 


8U( Ps, P) = — pi- Wo(27)*8(P,—P,—K), K = (w, 0). 


This expression is to be substituted in (19.4), and then the limit w — 0 is to 
be taken. 

When w — 0, we have a Galilean transformation from one inertial frame 
of reference to another moving with constant velocity dw. If there is in the 
liquid an elementary excitation with energy e(p), its energy in the frame mov- 
ing relative to the liquid with velocity ôw is e—p.dw.* Hence, in the new frame, 
the frequency po must appear in the function G(P) as po+p.ôw (so that the 
pole of the function is shifted by —p. ôw). Then 


ôG = p.dw0G/dpp, 


and we arrive at the identity 


i OG(P 
p Po 


= —{GP)}, {ene f Tp «a P, 0) {CO} | 09.15) 





We shall need to use these identities, in particular, for values of the free 
variable P = (po, p) on the Fermi surface: Pp = (0, pp). Transferring the factor 
G°*(P) from the right-hand to the left-hand sides, we replace the derivatives of 
G(P) there by those of G~1(P); the way in which the limit K — 0 is taken in 
G(P)G(P+ K) is unimportant. 

Near the Fermi surface, the Green’s function is determined by the pole 
term, so that 


G-\(P) = 5 [Po— vr (p —prF)]. 


Hence, on the surface itself, 








t In the classical Lagrangian of a free particle, L = imv*, the change to moving coordi- 
nates is effected by substituting v— v+ ôw, and gives an increment ôL = mv.6w which is small 
if ôw is small. Accordingly (cf. Mechanics (40.7)) the increment of the Hamilton’s function is 
6H = —p.ôw, and in quantum mechanics this corresponds to the operator (19.14). 

t See the more detailed discussion in §23. 
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Consequently, for example, the identities (19.9) and (19.13), on the Fermi sur- 
face, take the form 


‘| Th, al Pr, OGO or = (1 -z) Sap, (19.16) 
‘| Th, Pes NOM Boor = (1-F FF) dep. 19.17) 


§ 20. Derivation of the relation between the limiting momentum 
and the density 


The relations derived in the preceding sections provide a consistent proof of 
the fundamental proposition in the Landau theory of the Fermi liquid: the 
relation between the limiting momentum p,and the density N/V of the liquid 
is given by the same formula (1.1) as for an ideal gas. 

The idea of the proof is to calculate independently the changes in N and pp 
due to an infinitesimal change in the chemical potential u, and then to com- 
pare them. 

According to (7.24), the total number of particles in a given volume V, as 
a function of the chemical potential, is given by the integral 





t — -—0 


4 
N = —2iV lim [on e~!pot = ; P = (Do, P). (20.1) 


Hence the derivative 





1 dN. f aG(P) dP (20.2) 


V du ðu (20° 


Since this integral converges for large po (0G/ðu oc 1/p% when | po! — œ), the 
factor e~'?' may be omitted from the integrand. After substitution of 6G/Ou 
from the identity (19.13) summed over « = £, we find 


1 dN : 
ar a f (GP)}x a + f {GAPI TP, OGO} ae 


where we have put "= Iy, « for brevity. The object of the calculation is now 
to express the right-hand side of this equation in terms of an integral over the 
Fermi surface only. 
First, we replace J“ in the second integral from (17.17), with Spin place of 
Qr: 
y a = Y SEP Gort ETE, NOOK Goer 


PZ fig rm, d'P d'O do 
T f {GAPI S, P, SPATE, (Sr, OG (Dh n — . (20.3) 
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We begin by transforming the last term. In the integrand, only the last two 
factors depend on Q; the integral of these over d*Q is given (on the Fermi 
surface, S = Sp) by (19.17), and this term therefore becomes 


PZ? d*P d 
i ae | {GAP} T(P, Sr) Lar (1-3): 


Next, we note that in the integration over d‘P the limiting values of G(P) G(P+ 
+ K) are to be taken in the form ue hence {G?(P)}, = $(P), and 





{GX P)}x = (GP) (po) (p—Pr. (20.4) 
This gives 
¡PeZ d d‘P d 
ar (1-7 = a H {GA P) T(P, Sr) ae —8xiF ; 


where, in accordance with co we have used the quasi-particle interaction 
function and expressed f,, ,- in terms of F(@) by (2.6), (2.7); the bar over F 
denotes integration over do/4x. The remaining integral over d‘P is given by 
(19.16), and the integration over do, gives a further factor of 4. Thus the third 
term in (20.3) is equal to 


P(e ae) at (20.5) 


The second term in (20.3) is transformed similarly: the quantities {G?(P)},, and 
{G?(Q)},, are expressed in terms of {G?(P)},, and {G?(Q)},, by (20.4), and the 
identities (19.9) and (19.16) are then used. This term is then found to be equal to 


oG d‘P pZ? 1 
-2 | 5 Crt -2i | (PO Ge Gaj + oat fz (z-1)-FI . (20.6) 


The first integral gives zero on integration with respect to po, since G + 0 
when po > to. 
Lastly, the first term in (20.3), with the substitution of (20.4), becomes 


2 
2 fte *(D)}o Ta Sart ype . (20.7) 


vrn? 
Adding the contributions (20.5)— (20.7), we get 


1 dN _ pe dpr _ peZ apr 
V de a dat 7Up E vr(l +P. (20.8) 





On the other hand, by putting 
= (@n'/Cpp) Opp = O(p' —Pr) OPr 
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in (2.14), we easily find 
du/dpr = vp(l +F). (20.9) 


It should be emphasized that, in the derivation of (2.14), no specific depend- 
ence of pron N/V has been assumed, and we can therefore use this relation 
to find that dependence; equation (20.9) can, of course, also be obtained with 
the aid of the same relations for the vertex functions as were used in deriving 
(20.8).* 

From this equation we see that the quantity in the braces in (20.8) is zero, 


and so 
d [N 2 d d | 8xp3 
N ay Lc gia mel (20.10) 
du\V n du du | 3(2z) 


When N/V — 0 we have a gas, and in this limit the dependence of ppon N/V 
must therefore be the same as for a gas. This determines the constant in the 
integration of (20.10), and we have finally the required relation (1.1): 


N/V = 82p?,/3(22). 


§ 21. Green’s function of an almost ideal Fermi gas 


To illustrate the application of the diagram technique, we shall calculate in 
this section the Green’s function of an almost ideal Fermi gas in the model 
discussed in §6 by means of ordinary perturbation theory (V. M. Galitskii 
1958). The gas, it will be recalled, has repulsion between the particles, and the 
device described in §6 allows us to apply perturbation theory to this interaction, 
provided that the final result involves only the scattering amplitude. 

As shown in §14, the determination of the Green’s function reduces to the 
calculation of the self-energy function 2,,(P). In first- and second-order per- 
turbation theory, it is given by the set of diagrams (14.9) and (14.10). These 
may be put in the form 


P-Q 
=) - P lo + + 
(a) PE n 
a 
(21.1) 
t I 1 + l | 
tot 1 i 
P Paala p 


(c) (d) 


t Formula (2.11) for the effective mass can be deduced from the relation (17.17) and the 
identities (19.11) and (19.15). 
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The diagrams (21.1a, b) include the first-order diagrams (14.10a) and (14.9a) 
and the second-order diagrams (14.10b, c) and (14.9b, c); the latter differ from 
the former only by corrections to the internal continuous line. These lines are 
shown thick in (21.1a, b) and must therefore be correlated not with the ideal- 
gas Green’s functions G® but with the functions G corrected as far as first- 
order terms. The diagrams (21.1c, d) are the second-order diagrams (14.104, e). 
All the diagrams have been subjected to a deformation in order to clarify 
their structure; they are the first terms in a “ladder” series of diagrams with 
four external lines, in each of which a pair of external lines have been “short- 
circuited” in two different ways. 
We first calculate the diagram (21.1a). Its analytical expression is 


[-i2(P)]. = f U(Q) G(P-Q) d*Q/(2x), 
Q=(%,9), P = (w, p); 


the common factor 6,, is omitted. We first integrate over qo. Since the factor 
U(Q) = U(q) does not depend on qo, and G oc 1/go when :go, > œ, the manner 
of integration has to be more precisely described. For this, we must go back 
to the origin of the diagram (21.1a), and note that the continuous line there 
corresponds to the contraction of a pair of y operators arising from one 
operator V. This means that Y and ¥ are taken at the same instant, and $- 
is to the left of Ý in the contraction. That is to say, in the coordinate representa- 
tion the G function occurring is taken for t = tı— t + —0. In the momentum 
representation, this means including a factor exp (—/qgof) in the integrand of 
(21.2) and taking the limit as t>—0. Now using (7.23), we find 


[—iZ]. = i | U@)N@—4) d?q/ (27), (21.3) 


where N(p) is the particle distribution function. 

The Fourier component U(q) depends markedly on q only when q Z 1/ro, where 
ro is the range of action of the field U(r); these values are certainly large (for a 
rarefied gas) in comparison with pp. If we consider only values | p—prp « 1/ro. 
then for these values of q we have N(p—q) ~ 0. Hence U(q) in (21.3) may be 
replaced by U(0) and taken outside the integral." The remaining integra! is half 
(because of the specified value of the spin component) the gas density n(u): 


[Z] = — nlu) U(0). 


The diagram (21.1b), with the closed continuous line, gives [2], = n(u) U(0). 
Thus the contribution to X from the two diagrams is 


(21.2) 


[2'Ja,6 = 3(u) UO) = (22/m) n(u)a, (21.4) 
where a is the scattering length defined by (6.2). 


t The resulting error is easily seen to be of the relative order of magnitude ~ (pprp), and 
therefore has no effect even on the terms of the next order in p pro. 
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The expression (21.4) includes, in particular, the whole of the first-order 
effect. In this approximation, n() is to be understood as the ideal gas density 


n(u), so that 
ZO = [Z], = (20 /m) n©(u) a. (21.5) 


For the subsequent calculation we define an auxiliary function F given by the 
ladder diagrams 


Psy t——— P a Ea e atmam 
me a nh ae, a (21.6) 
(as usual, Pı+P2 = P3+ Ps). In analytical form, 
iF ya, ap(Ps, Pa; Py, Po) = 18xy0pe(F + FO), (21.7) 
where 
iF® = —iU(P;—P)), (21.8) 


iF® = | GO(P’) U(P,—P')GO(P, + P,—P’) U(P! — P,) d*P' N27). (21.9) 


Expanding both diagrams (21.Ic), (21.1d) and expressing them in terms of 
F®, we obtain 
[-iZ(P)],a = — | GQ) F@(P, Q; Q, P)d*Q/(2n)* 
+2 | GQ) FOP, Q; P, O)d*O/(2n)'; (21.10) 

the same integrals with F™ in place of F® give (21.5). The difference in sign 
between the two integrals is due to the presence of the closed loop in the dia- 
gram (21.1d); the delta factors in the first diagram give 6,,6,, = 5,s, and those 
in the second diagram 6,,6,,, = 28,.. 

Let us now calculate F. Since U(Q) is independent of go, the integration 
with respect to p, reduces to 


| GO(P) GO(P, + P)—P') dpi/2n. 


Substituting here G from (9.9) and using the convergence of the integral for 
' poal — œ, we close the contour of integration by an infinite semicircle in one 
half-plane of the complex variable p,; the integral is zero unless the poles of 
the two functions G lie in different half-planes, i.e. 


sgn (p’ — pr) = sgn(|P,+P2,—P'|—Pr). (21.11) 
The result is 
FO(Ps, Py; Py, Po) 
~ | —— Vm- Up -psen -pr dP 


7 1 12 12 . r (27)? : 
Oy + w+ 2u — -yr [P+ (Pi + P2— P’)”] + 10. sgn (p' — pr) 


(21.12) 
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where w = P10, œz = p20. In order to satisfy the condition (21.11) automatic- 
ally, we must substitute in the numerator of the integrand 


sgn(p’—pr) — 1—4(p’)—9(1+P2—P’), 


where @(p) is the step function (1.10). 

We have seen in §16 that a sequence of ladder diagrams determines (in a 
vacuum) the mutual scattering amplitude of two particles. Hence the expression 
(21.12) contains the correction to the first-order terms in the scattering ampli- 
tude. This correction can be taken into account by substituting in F™ (21.8) 


U(ps—P1) — — (4/m) re f (Pz, P1). 


where f is the scattering amplitude? in a vacuum, correct to the second order, 
and at the same time subtracting from the expression F (21.12) the real part 
of its value in a vacuum, i.e. for pp = 0, u = 0, and the values w1 = p;/2m, 
wz = p2/2m corresponding to the energies of two real colliding particles (the 
“physical” external lines of the diagrams). We can then replace —re f by the 
value for zero energy, i.e. the scattering length a.* We thus have 


FO Py, Py; Py, Pe) 
--(4 2 ma —F(Ps + PoP) 
a w+ y+ 2-5 [p++ pA] + 10.580 (ppr) 


2m d3p’ 
P AAP- it PP | F ia 
The symbol P in the second term means that the integral is taken as a principal 
value; this is the result of separating the real part of the integral by means of 
the rule (8.11). 
Since the expression (21.13) is symmetrical in P, and P2, the two integrals 
in (21.10) are the same, and 


[-i2(P)]e.a = | GQ) FOP, Q; P, Q) d'Or). 
When the first term from (21.13) is substituted, the integral with respect to go 


is non-zero if 
sgn (p’ — pr) = —sgn(q—pr), (21.14) 


t Not to beconfused with the quasi-particle interaction function. 

+ This replacement could not be made in (21.12), since it would cause the integral to diverge 
for large p’. After the subtraction mentioned, the integral converges (for p’ ~ py) even with 
this replacement, which is therefore feasible. The subtraction of only the real part of the inte- 
gral (and accordingly the replacement of U by re f) is done in order to avoid a difficulty con- 
cerning the imaginary part of the scattering amplitude. The reason is that, for small momenta, 
re f is expanded in even powers of the momentum and im f in odd powers (see QM, §132). 
Hence the inclusion of the momentum dependence of f would lead to corrections of relative 
order (pya)*, which are negligible. The substitution U — —4//m, however, would mean 
taking into account the imaginary part of f, which brings in corrections of relative order p pa. 
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so that the two poles of the integrand are again in different half-planes of qo- 
When the second term from (21.13) is substituted, only the factor Go(Q) 
depends on qo; the integration with respect to go is carried out by means of 
formula (7.23) and gives N(q), the particle distribution function in an ideal 
gas, i.e. the step function 6(q). The result, when the contributions from all 
diagrams (21.1a)-(21.1d) are collected, is 


Zw, p) = (2%/m) n(u) at 2a, p), (21.15) 
where 
Zw, p) 
E 4ra)? v — (p')— O(p +a- pea) — 9(p')] 
lim O+ +a [q? —p’?—(p+q-—p’)?] +70.sgn (p’ —pr) 


2m6(q) d®q d8p' | 
pt +g—p?+(ptq—py[ (2x)* ’ Che) 


the factor 6(q)— 6(p’) in the numerator of the first term in the integrand replaces 
— sgn (q— pp) with the condition (21.14). 

First, we note that X has an imaginary part. It is separated from (21.16) by 
means of the rule (8.11), and is 


ETR ER (Sra f (ODL — OPIL- p +a- p’) 


3a d3p' 
+ [1 —6(q)] 0(p’) 6 +q — p')} ô [o +H +? —p?—(pt+a— PY) Gar 


(21.17) 





the expression in the braces is transformed using the fact that 62(p) = 6(p). 
The quasi-particle energy spectrum is calculated, according to (14.13), as 


2 2 2 
e(p) = $+ (ua + EO (Sa P); (21.18) 


in Z? we can put e ~ p?/2m with the necessary accuracy. The fact that £ is 
complex means that the excitations are damped (im € = 0). 

The presence of this damping expresses the instability of quasi-particles due 
to the possibility of their actual decay. A quasi-particle may lose part of its 
energy and so give rise to a pair of quasi-particles (particle and hole). Let us 
consider, for example, the first term in the braces in the integrand of (21.17). 
From the properties of the step function, this term is non-zero if 


p>pr, |q+p—p'|>pr, 4 <Pr. 
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These inequalities correspond to a process in which a quasi-particle with initial 
momentum p(p > pp) enters a state p' (p > p’ > pr), and the momentum 
p—p’ is transmitted to a particle within the Fermi sphere (momentum q < pr), 
which is excited to a state with momentum q+p—p’ outside the Fermi sphere; 
such a transition is equivalent to the appearance of two new elementary ex- 
citations, with momenta —q (hole) and q+p—p’. The law of conservation of 
energy in this process is expressed by the delta function in (21.17), in which 
o+ u acts as the initial energy of the quasi-particle e(p): 


e(p) = e(p’) + [e(a +p- p) — ela); 


here it is sufficient to put e(p) = p?/2m in the first approximation. In accordance 
with the significance mentioned, the energy e(p) determined by this equation in 
fact corresponds to a quasi-particle outside the Fermi sphere (e > u). 

Similarly, the second term in the braces in (21.17) results from processes in 
which a pair is generated by a hole. This term gives the damping of elementary 
excitations with £ < u. In the language of the diagram technique, the possi- 
bility of creation of a pair by a quasi-particle is indicated by the possibility of 
dividing the G function diagram into two parts by cutting through three con- 
tinuous lines, one of which is in the opposite direction to the other two. In the 
diagrams (21.1c) and (21.1d), such cuts are to be made between the two broken 
lines. 

The case of a slightly non-ideal gas is special (in comparison with the general 
case of any Fermi liquid) in that the quasi-particle spectrum in it is meaningful 
for all values of the momenta, and not only near the Fermi surface: the decay 
of the quasi-particles (im €) is relatively small, because the “gaseousness 
parameter” ap, is relatively small. Here, however, we shall give the final result 
of the calculations only for two limiting cases. 

Near the Fermi surface (| p—p;| < Ppr), we find 


ree = u+(p—pr) prim", 


with u from (6.14) and m“ from (6.17). For the decay of the quasi-particles, 
we have 


f 1 
im € = 2, (pray (p—pr)* sgn (p—pr). (21.19) 


The proportionality of this expression to (p—p,;)* has an obvious origin: one 
factor p—p, is the width of the region in momentum space (a thin spherical 
Shell) which contains the momentum of the quasi-particle after it has created 
a pair, and the other factor p—p, is the width of the layer in which the pair is 
created. These considerations, it may be noted, apply to any Fermi liquid, so 
that we always have im e oc (p— pp}? near the Fermi surface." 


t At non-zero temperatures, the averaging of this quantity over the thermal distribution: 
makes the decay proportional to T?, as discussed in §1. 
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For large momenta p > pp (but still pa « 1), we have 








PFP 2 
n (pra). (21.20) 


In both cases the ratio im e/re e is small. The maximum value of this ratio 
is reached when p ~ pp, but even then it is ~(p,a)? œ 1. 

Lastly, the value of the renormalization constant for the Green’s function 
of a slightly non-ideal gas, calculated as 


Hoes 1-[ ee) 
Z Ow ERES 
is 
A ET (21.21) 
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CHAPTER III 


SUPERFLUIDITY 


§ 22. Elementary excitations in a quantum Bose liquid 


LET us now consider quantum liquids with an energy spectrum of a completely 
different type, which may be called a Bose spectrum.* 

This spectrum has the property that the elementary excitations (which are 
absent in the ground state of the liquid) can appear and disappear singly. But 
the angular momentum of any quantum-mechanical system (in this case, the 
liquid) can change only by an integer. Hence the elementary excitations appear- 
ing singly must have integral angular momenta, and therefore obey Bose 
statistics. Any quantum liquid consisting of particles with integral spin (such 
as the liquid isotope He*) must certainly have a spectrum of this type. 

For comparison, it may be recalled that in a Fermi liquid, described in terms 
of the spectrum of elementary excitations, which are absent in the ground 
state (see the end of §1), these excitations can only appear and disappear in 
pairs. This is the reason why elementary excitations in that type of spectrum 
can have a half-integral spin. 

In a quantum Bose liquid, elementary excitations with small momenta p 
(wavelength large compared with the distances between atoms) correspond to 
ordinary hydrodynamic sound waves, i.e. are phonons. This means that the 
energy of such quasi-particles is a linear function of their momentum: 


€ = up, (22.1) 


where u is the velocity of sound in the liquid. The latter is given by the usual 
formula u? = OP/0o, and there is no need to specify whether the derivative is 
taken at constant temperature T or at constant entropy S, since S — 0 when 
T + 0.* 

The number of elementary excitations in a Bose liquid tends to zero as 
T — 0, and at low temperatures, when their density is sufficiently small, the 


t The theory of such quantum liquids was worked out by L. D. Landau in 1940-1941, fol- 
lowing P. L. Kapitza’s discovery of the superfluidity of liquid helium. These discoveries formed 
the basis for the whole of the modern physics of quantum liquids. 

+ The concept of phonons has been defined in Part 1,§§71 and 72, for elementary excita- 
tions in solids. It must be emphasized that the momentum of an elementary excitation in a 
microscopically homogeneous system (a liquid) is the actual momentum, and not the quasi- 
momentum as in the periodic field of the crystal lattice in a solid. 
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quasi-particles may be regarded as not interacting with one another, i.e. as 
forming an ideal Bose gas. Hence the statistical-equilibrium distribution of 
elementary excitations in a Bose liquid is given by the Bose distribution formula 
(with zero chemical potential; cf. the last footnote to §1) 


n(p) = [et PT — 1]-1, (22.2) 


With this distribution, and knowing the function e(p) for small p, we can 
calculate the thermodynamic quantities for the liquid at temperatures so close 
to absolute zero that practically all the elementary excitations present in the 
liquid have low energies, i.e. are phonons. The corresponding formulae can be 
written down immediately by using the expressions for the thermodynamic 
quantities in a solid at low temperatures (see Part 1, §64). The only difference 
is that, instead of the three possible directions of polarization of sound waves 
in a solid (one longitudinal and two transverse), in a liquid there is only one 
(longitudinal), and so all the expressions for the thermodynamic quantities are 
to be divided by 3. For example, the free energy of the liquid is 


F = Fy—V.7°T4/90(fiu)’, (22.3) 
where Fo is the free energy at absolute zero. The energy of the liquid is 
E = E,+V.2°T*/30(hu)*, (22.4) 
and the specific heat 
C = V.27°T3/15 iu)’, (22.5) 


proportional to the cube of the temperature. 

The phonon dispersion relation (22.1) is valid only if the wavelength A/p of 
the quasi-particle is large compared with the interatomic distances. As the 
momentum increases, the curve of e(p) of course deviates from the linear form; 
its subsequent form depends on the particular law of interaction of the liquid 
molecules, and therefore cannot be determined in a general form. 

In liquid helium, the dispersion relation of the elementary excitations has the 
form shown in Fig. 2: after an initial linear increase, the function e(p) reaches 
a maximum, then decreases and passes through a minimum at a certain mo- 
mentum value po.' In thermal equilibrium, the majority of the elementary 
excitations in the liquid have energies near the minima of e(p), i.e. in the region 
of small e (near € = 0) and in the region of e(po). These regions are therefore 
particularly important. Near p = po, the function e(p) may be expanded in 
powers of p—po. There is no linear term, and we have as far as the second- 


order terms 
e = A+(p—py}?/2m", (22.6) 


t This form of the spectrum was first suggested by L. D. Landau (1947) from an analysis of 
experimental results regarding the thermodynamic quantities for liquid helium; it was later 
confirmed by neutron scattering experiments. 

A qualitative theory of such spectra was given by R.P. Feynman (1954); see the footnote: 
following (87.5). 
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erp 


where 4 = e(po) and m“ are constants. Quasi-particles of this type are called 
rotons. \t must be emphasized, however, that both phonons and rotons are 
quasi-particles corresponding only to different parts of the same curve, and 
there is a continuous transition from one to the other. 

The empirical values of the energy spectrum parameters for liquid helium 
(extrapolated to zero pressure and density 9 = 0.145 g/cm’) aret 


l 2 3 p/h Mcm 
Fic. 2. 


u = 2.4 X 104 cm/sec, A = 8.7K, 


227 
Polft = 1.9108 cm-3, m* = 0.16m (He). (22.7) 


Since the roton energy always includes the quantity 4, which is large com- 
pared with T at temperatures sufficiently low for a “roton gas” to be considered, 
this gas may be described by the Boltzmann distribution instead of the Bose 
distribution. Accordingly, to calculate the roton part of the thermodynamic 
quantities for liquid helium we start from the formula for the free energy of 
a Boltzmann gas: 


F= -NT 10g S [ ene” dr, dr = d3p/(27h}; 


see Part 1, §41. In this formula, N is to be taken as the number of rotons in the 
liquid, which is itself determined by the condition of thermodynamic equilib- 
rium, i.e. by the condition of minimum free energy. Equating OF/ON to zero, 
we find for the number of rotons 


N, = V fe-T dr, (22.8) 


t The chemical potential of liquid helium at T = Ois u = —7.16°K. 
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which of course corresponds to the Boltzmann distribution with zero chemical 
potential. The corresponding value of the free energy is 


F, =—VT Í e~IT dr. 


The expression (22.6) is to be substituted in these formulae. Since p? >> m*T, 
in integrating with respect to p we can take the factor p? outside the integral 
and replace it with sufficient accuracy by pọ. In integrating the exponential we 
can extend the range of integration from — œ to oo. The result is 

2(m*T)!? pV 


N, = mnp E F, = —TN,. (22.9) 


Hence the roton contributions to the entropy and the specific heat are 


2 
S=N(3 +F) C= N (gtr r) (22.10) 


T T T) 
We see that the temperature dependence of the roton part of the thermodynamic 
quantities is essentially exponential. At sufficiently low temperatures (below 
about 0.8°K for liquid helium), the roton part is therefore less than the phonon 
part, while at high temperatures the position is reversed and the roton contribu- 
tion is greater than that of the phonons. 
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A quantum liquid with an energy spectrum of the type described above 
possesses a remarkable property known as superfluidity: the property of flowing 
through narrow capillaries or slits without exhibiting viscosity. Let us first 
consider a liquid at absolute zero, at which temperature the liquid is in its 
ground state. 

Let us consider a liquid flowing along a capillary at a constant velocity v. 
Because of the friction against the walls of the tube and the friction within the 
liquid itself, the presence of viscosity would have the effect that the kinetic 
energy of the liquid would be dissipated and the flow would gradually become 
Slower. 

It will be more convenient to discuss the flow in a coordinate system moving 
with the liquid. In such a system the liquid helium is at rest, and the walls of 
the capillary move with velocity —v. When viscosity is present, the liquid at 
rest must also begin to move. It is physically evident that the entrainment of 
the liquid by the walls of the tube cannot initiate movement of the liquid as a 
whole. The motion must arise from a gradual excitation of internal motions, 
that is, from the appearance of elementary excitations in the liquid. 

Let us suppose that a single elementary excitation appears in the liquid, with 
momentum p and energy «(p). Then the energy Eo of the liquid (in the coor- 
dinate system in which it was originally at rest) is equal to the energy e of the 
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excitation, and its momentum Po is equal to p. Let us now return to the coor- 
dinate system in which the capillary is at rest. According to the familiar for- 
mulae of mechanics for the transformation of energy and momentum, we 
obtain for the energy E and momentum P of the liquid in this system 


E=Ey+Py.vt+iMe, P =[P)+My, (23.1) 
where M is the mass of the liquid. Substituting £ and p for Eo and Po, we have 
E=e+p.v+3Mv (23.2) 


The term 4Mv? is the original kinetic energy of the flowing liquid; the ex- 
pression e+p.v is the change in energy due to the appearance of the excitation. 
This change must be negative, since the energy of the moving liquid must 
decrease: e+p.v < 0. 

For a given value of p, the quantity on the left-hand side of this inequality 
is a minimum when p and v are antiparallel; thus we must always have e—pv < 


< 0, or 
v > efp. (23.3) 


This inequality must be satisfied for at least some values of the momentum p 
of the elementary excitation. Hence the final condition for the oçcurrence of 
excitations to be possible in the liquid as it moves along the capillary is obtained 
by finding the minimum of e€/p. Geometrically, the ratio ¢/p is the slope of the 
line drawn from the origin (in the pe-plane) to some point on the curve of e(p). 
Its minimum value is clearly given by the point at which the line from the 
origin is a tangent to the curve. If this minimum is not zero, then, for velocities 
of flow below a certain value, excitations cannot appear in the liquid. This means 
that the flow will not become slower, i.e. that the liquid exhibits the phenom- 
enon of superfluidity. 

The condition just derived for the presence of superfluidity is essentially 
equivalent to the requirement that the curve of e(p) should not touch the axis 
of abscissae at the origin (ignoring the unlikely possibility that it touches this 
axis at some other point). Thus any spectrum in which sufficiently small ex- 
citations are phonons will lead to superfluidity. 

Let us now consider the same liquid at a temperature other than absolute 
zero (but close to it). In this case the liquid contains excitations, and is not in 
the ground state. The arguments given above remain valid, since they made no 
direct use of the fact that the liquid was originally in the ground state. The 
motion of the liquid relative to the walls of the tube when the above condition 
is Satisfied still cannot cause any new elementary excitations to appear in it. 
It is, however, necessary to elucidate the effect of excitations already present 
in the liquid. 

To do this, let us imagine that the “gas of quasi-particles” moves as a whole 
with respect to the liquid, with a translational velocity v. The distribution 
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function for the gas moving as a whole is obtained from the distribution func- 
tion n(e) for the gas at rest by replacing the energy e of a particle by e—p.v, 
where p is the momentum of the particle. For an ordinary gas this is a direct 
consequence of Galileo’s relativity principle, and is proved by a simple change 
of coordinates, but in the present case such arguments cannot be applied 
directly, since the quasi-particle gas is moving not in a vacuum but “through 
the liquid”. Nevertheless, the statement remains valid, as can be seen from the 
following argument. 

Let the gas of excitations be moving relative to the liquid with velocity v. 
Let us take a coordinate system in which the gas is at rest as a whole, and 
the liquid is accordingly moving with velocity —v (system K). According to 
the transformation formula (23.1), the energy E of the liquid in the system K 
is related to the energy Eo in a system Ko where the liquid is at rest by 


E = E,—Py.v+3 Mv. 


Let an elementary excitation of energy e(p) in Ko arise in the liquid. Then the 
additional energy of the liquid in K is e—p.v, and this proves the statement." 
Thus the total momentum of the quasi-particle gas per unit volume is 


P= f pn(e—p.v) de. 


Let us assume that the velocity v is small, and expand the integrand in powers 
of p.v. The zero-order term gives zero on integration over the directions of the 
vector p, leaving 


P=- |r GP de 


or, on averaging over the directions of p, 
J dn 
= — —— —— 2 
P 3 r|( de pdt. (23.4) 


First of all, we see that the motion of the quasi-particle gas is accompanied 
by a transfer of mass: the effective mass per unit volume of the gas is deter- 
mined by the proportionality coefficient between the momentum P and the 
velocity v in (23.4). On the other hand, in the flow of a liquid along a capillary 
(say) there is nothing to prevent the quasi-particles from colliding with the 
walls of the tube and exchanging momentum with them. In consequence the 
excitation gas will be slowed down, like any ordinary gas flowing along a 
capillary. 

Thus we have the following fundamental result. At non-zero temperatures, 
part of the mass of the liquid will behave as a normal viscous liquid which 


t For quasi-particles in a Bose liquid, n(e) is the distribution (22.2). It should be noted that 
the superfluidity condition v < e/p is precisely the condition for n(e— p.v) to be positive and 
finite for all energies. 
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“sticks” as it moves along the walls of the vessel; the remaining part of the mass 
will behave as a superfluid without viscosity. Here it is very important that 
there is no friction between these two parts of the mass of the liquid as they 
pass “through one another”, that is, there is no transfer of momentum from 
one part to the other. For the existence of such motion of one part of the mass 
of the liquid relative to the other has been derived by considering the statistical 
equilibrium in a uniformly moving excitation gas. But if any relative motion 
can occur in a state of thermal equilibrium, it is not accompanied by friction. 

It should be emphasized that the treatment of the liquid as a “mixture” 
of normal and superfluid “parts” is simply a form of words convenient for 
the description of the phenomena in a quantum liquid. Like any description 
of quantum effects in classical terms, it is not entirely adequate. It does not at 
all mean that the liquid can actually be separated into two parts. In reality 
we should say that in a quantum Bose liquid there can exist simultaneously 
two motions, each of which has a corresponding “effective mass” such that 
the sum of these two masses is equal to the actual total mass of the liquid. 
One of these motions is “normal”, i.e. has the same properties as that of an 
ordinary viscous liquid; the other is “superfluid”. The two motions occur 
without transfer of momentum from one to the other. 

Thus, in the hydrodynamic sense the density of a Bose liquid can be written 
as a sum ọ = 0,40, of normal and superfluid parts, each corresponding to a 
hydrodynamic velocity v, or v, An important property of superfluid motion 


is that it is a potential flow: 
curl v, = 0. (23.5) 


This property is the macroscopic expression of the fact that the elementary 
excitations with long wavelength (i. e. with small momentum) are sound 
quanta (phonons). Hence the macroscopic hydrodynamics of superfluid mo- 
tion must not allow other than acoustic vibrations,’ as is ensured by the con- 
dition (23.5); the proof of this condition will be considered in §26.* 

When T = 0, the normal part of the density o, = 0; the liquid can have only 
superfluid motion. For non-zero temperatures, o, is given by (23.4): 


1 dn\ , 
On = al (Sa) dt. (23.6) 


To calculate the phonon contribution to 0,, we put in (23.6) e = up: 


D 1 à dn „ 4p? dp 
(on = 5 | GeO a 
0 


t The liquid is assumed infinite. When there is a free surface, surface capillary waves are 
also possible, and lead to a definite temperature dependence of the surface tension; see Prob- 
lem 1. 

t A detailed account of the hydrodynamics of a superfluid is given in FM, Chapter XVI. 
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and obtain, on integration by parts, 


_ 4 C 4x p® dp de 
(On)ph = sl np Ori = 3u fen dr. 


0 


The remaining integral here is iust the energy of the phonon gas per unit 
volume; taking this from (22.4), we have finally 


(On)ph = 4Epr/ 3V 


= 27nPT4/45h3u5, (23.7) 
To calculate the roton contribution to 9, we note that, since rotons can be 
described by a Boltzmann distribution, for them dn/de = —n/T, and from 
(23.6) 
1 p? N, 
aD aoan 2 =n Ee 
Que = ap | Pn er = 


Since p? = p with sufficient accuracy, we find, taking N, from (22.9), 


N, men 
C = FP = PeT E (23.8) 


At very low temperatures, the phonon contribution to g, is large compared 
with the roton contribution. They become comparable at about 0.6°K, and 
at higher temperatures the roton contribution predominates. 

As the temperature increases, an increasing fraction of the mass of the liquid 
becomes normal. At the point where 0, = ọ, the property of superfluidity 
disappears entirely. This is called the A-point of the liquid, and is a phase 
transition point of the second kind.t The quantitative formulae (23.7) and 
(23.8) are, of course, inapplicable near the A-point, where the quasi-particle 
concentration becomes large, so that even the concept of quasi-particles is 
largely meaningless. 

We may also consider the behaviour of the atoms of substances dissolved 
in liquid helium; the concentration of the impurity is assumed to be so small 
that its atoms may be regarded as not interacting with one another (L. D. 
Landau and I. Ya. Pomeranchuk 1948). 

The presence of an extraneous atom in the liquid gives rise to a new branch 
of the energy spectrum corresponding to the motion of this atom through 
the liquid; of course, owing to the strong interaction of the impurity atom with 
the atoms of the liquid, this motion is really a collective effect in which the 
liquid atoms also take part. A resultant conserved momentum p may be as- 
cribed to this motion. Thus quasi-particles of a new type appear in the liquid, 


t Liquid helium is called helium I at temperatures below this point. The A-points form 
a curve in the phase diagram in the PT-plane. This curve intersects the liquid—-vapour equilib- 
rium curve at 2.19°K. 
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whose number is equal to the number of impurity atoms, and whose energy 
Eimp(P) is a definite function of the momentum. In thermal equilibrium, the 
energy of these quasi-particles is concentrated near the lowest minimum of 
the function £mp(P). In practice, we are concerned with the He’ isotope impu- 
rity, and empirical results show that this minimum is at p = 0; near that point, 
the quasi-particle energy is 

Eimp(P) = p?/2mimp, (23.9) 


with the effective mass Mimp equal to 2.8 times the mass of the He? atom. 

Impurity quasi-particles interact with phonons and rotons when they col- 
lide with these, and therefore belong to the normal part of the liquid. Because 
of their low concentration, their thermal distribution is of the Boltzmann type, 
and their contribution to ọ„, determined from (23.6), is 


(Onimo = > ar = SP np, (23.10) 


where Nimp/V is the number of impurity atoms per unit volume. 


PROBLEMS 


PROBLEM 1. Find the limiting temperature dependence of the surface tension coefficient œ 
of liquid helium near absolute zero (K. R. Atkins 1953). 


SOLUTION. The coefficient « is the free energy per unit area of the liquid surface; see Part 1, 
(154.6). It is calculated from Part 1, (64.1), in which the frequencies wg now relate to surface 
vibrations. In the two-dimensional case, the change from summation to integration (over the 
wave vectors of the vibrations) is effected by including a factor d*k/(2x)* or 22k dk/(2n)°. 
Integration by parts gives 

= «+T f log (1 — e-ż®1T) k dk/2x 
_ À k? dw 
= M0 Ge} nT’ 


where @ is the surface tension at T = 0. At sufficiently low temperatures, only vibrations with 
low frequencies (i.e. long wavelengths) are important. Such vibrations are hydrodynamic 
capillary waves, for which w* = ak?/p ~ a k*/o (where o is the density of the liquid). Hence 


h (2 i f w43 dw 


a= Qn- —— | — — ; 
j 4n Xo eħolT— 1 4 


since the integral converges rapidly, the upper limit may be replaced by infinity. The calcu- 
lation of the integra! (see the note in Part 1, §58) gives 


713 72/3 
T Z (1/3) t013) 


~ Farhi ols 
= Oo — 0.13 T2!9 92/3 /fAl3 2/9, 


This applies to liquid He‘ at temperatures so low that the whole mass of the liquid may be 
regarded as superfluid.t 


a= Xo 


t In a Fermi liquid (liquid He’) capillary waves of the type considered (like volume waves 
of ordinary sound) do not exist, since the viscosity increases without limit as T > 0. 
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PROBLEM 2. Find the dispersion relation £mp(p) for impurity particles in a moving super- 
fluid if its form {9 (p) ina liquid at rest is known (J. Bardeen, G. Baym and D. Pines 1967). 


SOLUTION. After the addition to the liquid at rest (T = 0) of an impurity atom with mass 
m and momentum pp, the energy and momentum of the liquid, in the coordinate system in 
which it was originally at rest, are Ey = ¢{2,(p9), Po = Po. In coordinates such that the liquid 
is moving with velocity v, we have from (23.1) 


E= elpo) + Po «V+ (M+m) v, P= p+(M+m vy. 


Hence we see that the changes of energy and momentum of the moving liquid when an impu- 
rity atom is added to it are 


Eimp = Elpo) +Po Vimo, P= Potmv. 
Expressing Emp in terms of p, we find 
Emp(D) = eh- mv)+p.v— 3 mv’. 
For small v, as far as the first-order terms, with a spectrum £§} (p) of the form (23.9), we have 


$ m 
Eimp(P) = me +v.p (1- ) 2 


* 
Mimp 








§ 24. Phonons in a liquid 


When we go from the classical picture of sound waves to the quantum concept 
of phonons, the hydrodynamic quantities (density, velocity of the liquid, etc.) 
are replaced by operators that can be expressed in terms of the phonon anni- 
hilation and creation operators ¢,, ĉȘ . We shall derive such expressions. 

First, we recall that, in the classical description of a sound wave, the density 
of the liquid undergoes small oscillations whose frequencies and wave vectors 
are related by œ = uk. The velocity v of the liquid is a quantity of the same 
order of smallness as the variable part 9’ = 0—@ of the density (where go 
is the equilibrium value of the density). The motion of the liquid in the wave 
is a potential flow, i.e. it can be described by a scalar velocity potential ¢ 
which determines the velocity according to 


v= V$. (24.1) 


The velocity and the density are related by the equation of continuity 09’ /dt = 
= — div (ov) ~ — 00 div v, or 


00'/Ot = — Ad. (24.2) 
The energy of the liquid in the sound wave is given by the integral 
E= f (+ Oov? + u?0"?/200) d3x. (24.3) 


The first term in the integrand is the kinetic energy density, and the second 
the internal energy density, of the liquid; both are quadratic in the small 
quantities v and 0’. 
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The subsequent quantization procedure could be carried out in an exactly 
similar way to that for phonons in solid crystals (see Part 1, §72). We shall 
take a somewhat different route, however, which illustrates some instructive 
points of methodology. Let us first consider the liquid density and velocity 
operators expressed in terms of microscopic variables, the coordinates of the 
particles. 

In the classical theory, the density ọ and the mass flow density j of the liquid 
can be written as sums 


e(r) = 2, maS(ta—"); įr) = È Paô(Ta— 1), 


taken over all the particles, where r, and p, are the position vectors and mo- 
menta of the particles. The integrals of these functions over any volume give 
the total mass and total momentum of the liquid in that volume. When we 
go to quantum theory, these functions are replaced by the corresponding oper- 
ators. The density operator has the same form: 


ôlr) = » MaÔ(Ta — T); (24.4) 
the current density operator is 
Kr) = + 3 {Dað(Ta — r) + ô(Ta — r) Pa}, (24.5) 
where fp, = —ifv, is the momentum operator of the particle.* 


Let us find the commutation rule for the operators j(r) and @(r’) taken at 
points r and r’: for brevity, we may consider just one term in the sums (24.4) 
and (24.5), since the operators corresponding to different particles commute. 
In the expansion of the commutator, the operators of the form 6(r1—r) 716(r1— 
—r’) are transformed as follows: 


(r, —r) V,0(r,—r’) = 6(r,—1)(VO(r—r’)) + òlr, — r) Hr, — r’) Vis 


where in the first term (V6(r—r’)) denotes simply the gradient of the delta 
function; because of the presence of the factor 6(r1—r), we can replace (vi6(r1— 
—r’)) by (vé(r—r’)) in that term. The result is 


Kr) 60") — 60’) je) = — ih6(vor—r’)). (24.6) 
Now, instead of j, we use the liquid velocity operator %, defined by 
j= 3 (6? +0ô). 
t For simplicity, let the system consist of only one particle. Averaging the operator 


G(r) = mô(r,— r) over the state with wave function y(r,) gives f v*r6p())d?x,= m|yp(r)|?, as 
it should. Similarly, averaging the operator j(r) gives the correct expression for the current 


density, 
(A2) {p*(r) Ve(r)— v@) Very}. 
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The commutation rule for the operators 6 and¥is determined by the require- 
ment that the expression (24.6) is obtained for the commutator of 6 and i. 
It is easily verified that for this to be so we must put 


(r) OF’) — G(r’) Hr) = — ih(vôlr —r')), 


using the obvious commutativity of the operators 6(r) and 6(r’). Lastly, put- 
ting (r) = V@(r), we find the commutation rule for the density and velocity 
potential operators: 


b(t) f'e) — ô) P(t) = — ihd(r -— r’); (24.7) 


here we must of course replace 6 by the operator 6’ = 6— 90 of the variable 
part of the density. The rule (24.7) is analogous to that for the particle coordi- 
nate and momentum operators; in this sense, 9’ and ¢ here act as canonically 
conjugate generalized “coordinates” and “momenta”. 

Having used the expressions (24.4) and (24.5) to establish the rule (24.7), 
we can now write the operators @ and @’ in the second-quantization represen- 
tation (i.e. express them in terms of the phonon annihilation and creation 
operators), with the requirement that they satisfy the rule (24.7). To do so, 
we write 


d(r) = y Y; (Aud eft: + Agee e~t") 
k 


with coefficients A, as yet undetermined; the summation is over all values of 
the wave vector that occur for a liquid with large but finite volume V.t The 


operators é, and é satisfy the Bose commutation rules 
exc — Cee = Ow. (24.8) 


For subsequent reference, the non-zero matrix elements of these operators 
are 


(me — 1 Gel m) = Cm | Cie | te — 1) = Vi, (24.9) 


where n, are the occupation numbers of the phonon states. 

We shall later need, however, not the Schrödinger operator $(r) but the 
Heisenberg operator ĝ(t, r). This is obtained from @(r) by simply including 
the factors exp (tiwt) with frequencies w = uk in each term of the sum 


b(t, r) = <P 3 (Anck elk .r—kut) + Atet e~ik -t—kut)); 
k 


t Unlike the y operators of particles, the operator of the real quantity œ is Hermitian and 
contains both phonon creation and phonon annihilation operators. This property (like the 
corresponding property of the field operators in quantum electrodynamics) is due to the non- 
conservation of the number of “particles” in the phonon field. 
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cf. the relevant comment for the y operators at the beginning of §9. The density 
operator 6’(t, r) must be related to A(t, r) by (24.2), and is therefore given by a 
similar sum with factors iA,0ok/u in place of A,. The factors 4, must then be 
determined so as to satisfy the commutation rule (24.7). This gives the follow- 
ing final expressions: 


H(t r) = 2 a) (ex elk .r—ukt) +. at elk. 1—akt)) 
i a \ 2Vook > 


(sae : (24.10) 
Ar t, r) = il> a elk .r—ukt) __ A e—i(k .r—ukt) : 
For, on substituting these expressions on the left of (24.7) and using (24.8), we 
obtain the required delta function: 


e 1 A A A A 
ih D (did — ât) ek e= 
k 


ih ih V dk 
; =r) es SF —r’ 
(On) ind(r — r’). 
It is also easy to see that the Hamiltonian of the liquid, obtained by substi- 
tuting = v@ and @’ in place of v and 0’ in the integral (24.3), has the form 


k 


as it should; its eigenvalues are Zuñk(n, ++), in accordance with the concept 
of phonons having energies € = uhk. 

The expression (24.3) for the energy of a liquid in a sound wave consists of 
the first (after the zero-order) terms in an expansion of the exact expression 


E = | [$Y + ee(g)] dx, 


where e(ọ) is the internal energy of the liquid per unit mass. This integral, 
with v and ọ replaced by the operators ¥ = v@ and 6 = o0+ 6’ with @ and 
6’ from (24.10), acts as the exact Hamiltonian of the liquid: 


A = | [Z9 -o+ Ge(6)] d8x; (24.11) 


the kinetic energy operator is written in the symmetrized form 44. 0¥, so as 
to be Hermitian. Here it is important that ọ and @ are canonically conjugate 
“generalized coordinates and momenta” in terms of which the Hamiltonian 
must be expressed. This is seen from the fact that the commutation rule (24.7) 
satisfied by the operators (24.10) is exact; the smallness of the oscillations is 
nowhere used in deriving it. 

The terms of higher (third, etc.) degree in the expansion of this Hamilto- 
nian represent the anharmonicity of the sound vibrations, or in terms of the 
phonon picture describe the interaction of phonons. They have non-zero 
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matrix elements for transitions with simultaneous change of several phonon 
occupation numbers, and thus act as a perturbation causing various phonon 
scattering and decay processes. The matrix elements of the operators ¢, and 
é have, of course, the previous form (24.9), since (as always in perturbation 
theory) the representation used is one in which the unperturbed Hamiltonian 
is diagonal. The terms of the third and fourth orders are 


2 5'3 
A® = { [z 9.0 (a a) =| d?x, (24.12) 
2 2 
A® = =a (iz £) | ĝ'i d?x. (24.13) 


§ 25. A degenerate almost ideal Bose gas 


The fundamental properties of the Bose-type energy spectrum are clear from 
the model of a slightly non-ideal Bose gas at almost zero temperature. This 
model will be considered in the present section in the same way as in §6 for 
a Fermi gas.' The whole of the discussion in §6 relating to the general charac- 
teristics of models of a degenerate almost ideal gas applies here also. In par- 
ticular, the condition of being onlyslightly non-ideal (the gaseousness parameter 
a(N/V)"* < 1, where a is the scattering length) can again be put in the form of 
the condition (6.1) that the particle momentum be small: pa/i « 1. 

The Hamiltonian of the system of bosons (assumed spinless) interacting in 
pairs differs from (6.6) only by the absence of the spin suffixes: 


; A 1 I n 
A =X Sâ; âp +5 2 (Pi P2 | U | PaPa) Ap ArQp.4p.» (25.1) 


with summation over all the momenta appearing as suffixes. The particle 
annihilation and creation operators now obey the commutation rules 


Gas — Abd, = 1. 


As in §6, we again make the assumption that the momenta are small, and 


replace all the matrix elements in (25.1) by their values for zero momenta; 
then 


2 U, A 4 A 
=F $> Oi dy +P È tind. (25.2) 


t The method given below is due to N. N. Bogolyubov (1947). His application of it to the 
Bose gas was the first consistent microscopic derivation of the energy spectrum of “quantum 
liquids”. 

t We shall see below that, in a degenerate Bose gas, the majority of the particles (outside 
the “condensate”) have momenta p ~ f/(aN/V), for which this inequality is indeed satis- 
fied. 
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The starting-point for the application of perturbation theory to this Hamil- 
tonian is the following remark. In the ground state of an ideal Bose gas, all 
particles are in the condensate, i.e. the, state of zero energy; the occupation 
numbers N =o = No = N, N, = 0 for p # 0 (see Part 1, §62). In an almost 
ideal gas, in the ground state and in weakly excited states, the numbers Np 
are not zero, but they are very small in comparison with the macroscopically 
large number No. The fact that the quantity âf a, = No ~ N is very large in 
comparison with unity means that the expression 


doad — GoGo = 1 


is small compared with ĉo and âf themselves, which may therefore be regarded 
as ordinary numbers (equal to VNo), their non-commutativity being neglected. 

The application of perturbation theory now signifies formally the expansion 
of the fourfold sum in (25.2) in powers of the small quantities 4,, âf (p = 0). 
The zero-order term in the expansion is 


Ag Af Gody = aH. (25.3) 


The first-order terms are zero (since they cannot satisfy the law of conserva- 
tion of momentum). The second-order terms are 


a >. (dpG_p + Gf at, +44; âp). (25.4) 


p #0 


Taking only the second-order terms, we can replace aĝ = No in (25.4) by 
the total number of particles N. In (25.3), the more accurate relation 


apt 2 â âp = 


must be used. The sum of (25.3) and (25.4) is then 
N?+N pa (âpâ—p + âp Âtp + 24p âp), 


and on substitution in (25.2) we get the following expression for the Hamilto- 
nian: 


N 
A= = FWh on Im ata, + toy Uo 2 (âĉbâ—p + Gp ay âT, +24, Oe): (25.5) 


The first term in this expression gives, in the first approximation, the energy 
Eo of the ground state of the gas, andits derivative with respect to N the chem- 
ical potential u at T = 0: 


Ey = N2U,/2V, u = NUJ. (25.6) 


The remaining terms in (25.5) give the correction to Eo and the spectrum of 
weakly excited states of the gas. 


100 Superfluidity 


The integral Uo in (25.5) has still to be expressed in terms of a real physical 
quantity, the scattering length a. In the second-order terms, this can be done 
directly from the formula (6.2): Uo = 4xh?a/m. In the first term, however, 
the more exact formula (6.5) is needed, which takes account of the second 
Born approximation in the scattering amplitude. Here we are considering the 
collision of two particles in the condensate, and accordingly in the sum in 
(6.5) we must put pi = P2 = 0, pi = — P: = p, So that 


4nh?a Anhta 1 
Uo = a ( + V >» 5) A 








Substitution in (25.5) gives for the Hamiltonian 
2nh?a N? 4nh?a 1 


2nh?a N Pra 
m Y Y, âti ay + 205 Gy) + TF ata, — 25.7 


A= 








To determine the energy levels, we must bring the Hamiltonian to diagonal 
form; this is done by a suitable linear transformation of the operators âp 
ât. With new operators b, and bt defined by 


Âp = Upbp t Vpbtp ÂF = ubg +,b_p, 
and the requirement that they satisfy the same commutation relations 
bby — bpb = 0, 5,65 — bb, = ôw 


as the â, and â} (it is easily seen that for this, we must have u, — v = 1), we can 
write the linear transformation as 


byALbt, , _ bt+Lb o 
vi- ’ © =v- ` a 


The quantity L, is to be defined so as to eliminate from the Hamiltonian the 
non-diagonal terms b,b, and by 6+,. A simple calculation gives 


a, = 


1 p? i 
Lp = aT e(P)— 5 — mu > (25.9) 
with the notation 
e(p) = [u®p? +(p?/2m)?}/?, (25.10) 
u = (4nh?aN/mv )2. (25.11) 


The Hamiltonian is then 


A=E+ Y elp) bi bp (25.12) 
px<0 
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where 





2 3,4 

Eo = zmei D, fao- + dhe l. (25.13) 

The form of the Hamiltonian (25.12) and the Bose commutation relations 
for the operators 8,, bt enable us to conclude that by and b, are creation and 
annihilation operators for quasi-particles with energy e(p) which obey Bose sta- 
tistics. The eigenvalues of the diagonal operator btb, represent the numbers n, 
of quasi-particles with momentum p, and formula (25.10) gives the dependence 
of their energy on the momentum. (The quasi-particle occupation numbers are 
again denoted by n, to avoid confusion with the actual gas particle occupa- 
tion numbers N,.) This completely determines the energy spectrum of weakly 
excited states of the gas in question. 

The quantity Eo is the energy of the ground state of the gas. Replacing the 
summation over the discrete values of p (in the volume V) by integration over 
V dp|(2nh} and completing the calculations, we get the expression 


2rh?a N? 128 , [ a®N 
B = 1 | z NVS | (25.14) 
(T. D. Lee and C. N. Yang 1957). The chemical potential of the gas (at T = 0) 
is correspondingly 


= 2B, _ ARAN fy, 32 f eN 
= So _ |! Var 7 |: (25.15) 


These formulae give the first two terms in an expansion in powers of (a3N/V)"”. 
Even the next term, however, could not be obtained by the above method. 
It must contain the volume as V~2, and a quantity of that order depends on 
triple collisions as well as on pair collisions. 

For large momenta (p >> mu) the quasi-particle energy (25.10) tends to 
p?/2m, i.e. to the kinetic energy of an individual gas particle. 

For small momenta (p << mu) we have £ ~ up. It is easy to see that the coeffi- 
cient u is the same as the velocity of sound in the gas, so that this expression 
corresponds to phonons in accordance with the general theorems in §22. 
At T = 0, the free energy is equal to Eo; taking the leading term in the expan- 
sion of the latter, we find the pressure 


P = —0E/0V = 2nh?aN?/mV?. 


The velocity of sound is u= +/(@P/0e), where 0 = mN/V is the gas density; 
it is the same as (25.11). 

In the model of a Bose gas here considered, the scattering length a must 
necessarily be positive (for a repulsive interaction between the particles). This 
is seen formally from the fact that imaginary terms would occur in the above 
formulae for the energy if a< 0. The thermodynamic significance of the 
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condition a > 0 is that it is necessary to satisfy the inequality (OP/OV), < 0 in 
this model of a Bose gas. 

The statistical distribution of elementary excitations (the mean values A, of 
their occupation numbers) at a non-zero temperature is given simply by the 
Bose distribution formula (22.2). The momentum distribution Ñ, of the actual 
gas particles can be calculated by averaging the operator 4) 4,. Using (25.8) 
and the fact that the products 6_,b, and 6*6*, have zero diagonal matrix 


elements, we get 
N, = [A +A, + D/C — LZ). (25.16) 


This expression is, of course, valid only if p # 0. The number of particles with 
zero momentum is 


V 
No = N— p N- f | Moar (25.17) 


In particular, at absolute zero all the n, = 0, and with (25.9) we obtain from 
(25.16) the distribution function in the formt 


my 


No = Datpye(py tp’ am EmA) * 


(25.18) 
when T = 0, the mean values of N, are the same as the exact values, and the 
bar over the letter is therefore omitted. The non-idealness of the Bose gas 
naturally causes the presence of particles with non-zero momentum even at abso- 
lute zero; the integration in (25.17) with N, from (25.18) is elementary, and 
gives 


a ela |. (25.19) 


No=N|1-3 wV 
Lastly, a comment on the spectrum derived here. For small p, the derivative 
@s/dp? > 0, i.e. the curve of e(p) turns upwards from the initial tangent 
€ = up. In such a case (see §34) there is an instability of the spectrum because 
of the possibility of spontaneous disintegration of the quasi-particles (phonons). 
The corresponding level width is, however, small (proportional to p* when p is 
small) and does not affect the expressions derived in the approximations con- 
sidered above. 


§ 26. The wave function of the condensate 


As already mentioned in §23, the appearance or disappearance of super- 
fluidity in liquid helium takes place by a phase transition of the second kind. 
Such a transition always involves some qualitative change in the properties of 


1The maximum number of particles with a given momentum magnitude (~ p?N,) occurs 
for p/i ~ »/(aN/V), where the change takes place from one limiting expression for e(p) to 
the other. This has already been mentioned in the second footnote to §25. 
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the body. At the A-point of liquid helium, this change may be described macro- 
scopically as the appearance or disappearance of the superfluid component of 
the liquid. From the more profound microscopic viewpoint it is a matter of 
certain properties of the momentum distribution of the (actual) liquid particles: 
in a superfluid, a finite fraction of the particles (i.e. a macroscopically large 
number of them) have exactly zero momentum; these particles form the Bose- 
Einstein condensate, or simply the condensate, in momentum space. In an ideal 
Bose gas at T = 0, all its particles are in the condensate (see Part 1, §62); in an 
almost ideal gas, almost all the particles are in the condensate. In the general 
case of a Bose liquid with strong interaction between the particles, the fraction 
of particles that are in the condensate at T = 0 is not close to unity. 

We shall show how the property of Bose-Einstein condensation is formulated 
in terms of y operators. For an ideal Bose gas (a system of non-interacting 
bosons), the Heisenberg y operator is written explicitly as‘ 


1 ` i i p? 
Yt, r)= gr bee peta Seth (26. 1) 


As explained in §25, we may ignore the non-commutativity of the operators 
do and âf, regarding them as classical quantities. In other words, part of the 
y operator (26.1) is an ordinary number, which we denote by £: 


Ê = âJ. (26.2) 


To formulate this property of the y operators in the general case of an arbi- 
trary Bose liquid, we note that, since the condensate contains a macroscopically 
large number of particles, changing this number by 1 does not essentially 
affect the state of the system; we may say that the result of adding (or removing) 
one particle in the condensate is to convert a state of a system of N particles into 
the “same” state of a system of N+1 particles.* In particular, the ground 
state remains the ground state. Let £ and Ê+ denote the part of the y operators 
that changes the number of particles in the condensate by 1; then, by definition, 


E\m,N+1) = 7 |m, N}, 
+ |m, N) = E*!m, N+ 1), 


where the symbols |m, N y and |m, N+-1) denote two “like” states differing 
only as regards the number of particles in the system, and & is a complex 
number. These statements are rigorously valid in the limit N — œ. Hence the 


t Cf. (9.3). We assume the gas particles spinless, and so the spin suffix is omitted. In (26.1) 
we have also used the fact that for an ideal Bose gas at T = O the chemical potential u = 0, 
and so the term — ut/h in the exponents is omitted. 

t The addition or removal of the particle is to be regarded as occurring with infinite slow- 
ness. This prevents excitation of the system by the variable field. 
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definition of £ is to be written 


lend 


lim (m,N|£|m,N+1) = £, 
PI aa “ (26.3) 
wim (m, N+1| £+ |m, N) = &*; 


the limit is taken for a given finite value of the liquid density N/V. 
If the y operators are written as 


P= BW, PH ao Ft wt, (26.4) 


their remaining part (“above the condensate”) converts the state |m, N) into 
states orthogonal to it, i.e. the matrix elements? 


lim (m, N|¥'|m,N+1) = 0, 
N =o 


Jim (m,N+1|2"* |m, N) = 0. (26:3) 


In the limit N - œ, the difference between the states | m, N) and |m, N+ 1Y 
disappears entirely, and in this sense £ becomes the mean value of the operator 
YW for that state. It must be emphasized that the finiteness of the limiting value 
is a characteristic of systems containing a condensate. 

The equations (26.3) complete the “operator” properties of Ê and Êt, 
and they may be regarded as commuting with & and ¥’*. In particular, the 
operators Ê and Ê+ will be replaced by £ and &" (i.e. will behave as classical 
quantities) in any averaging with respect to the ground state. We must emphasize 
again that (because the number of particles in the condensate is macroscopic) 
this approximation involves neglecting only quantities with relative order of 
smallness 1/N.* i 

If the time dependence of the wave functions is determined by the Ham- 
iltonian A’=A—puN, then £ is independent of time: the matrix element 
(m, N| &'|m, N+ 1) is proportional to 


exp [FEN 1)—E(N)—(N+1)u +Nal}, 


and the exponent tends to zero, since (to within a quantity ~ 1/N) E(N+1)— 
—E(N) = u. 

In a homogeneous liquid at rest, £ is independent also of the coordinates 
and is simply (with the appropriate choice of the phase of the complex quan- 
tity) 

£ = VM, (26.6) 


t To avoid misunderstanding, it may be mentioned again that these equations refer only to 


transitions between “like” states. 
+ In particular, to this accuracy we must regard as equal the matrix elements of the opera- 


tors ¥ for transitions between states differing by the same (small) number of particles in the 
system. 
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where no is the number of condensate particles per unit volume of the liquid: 
&* is the operator of the particle number density in the condensate, and the 
mean value of this operator is just no. 

The existence of the condensate brings about a qualitative difference in the 
properties of the density matrix for particles in a Bose liquid in comparison 
with the density matrix in an ordinary liquid. In an arbitrary state of a homo- 
geneous Bose liquid, the density matrix is given by 


Nolti, ra) = (mn, N ($+ (t, r) P(t, 1)! m, NY, (26.7) 


and this function depends only on the difference r = rı—r3; cf. (7.13). Substi- 
tuting here the y operators in the form (26.4) and using the properties (26.3) 
and (26.5), we get 

No(¥1, r3) = no +No'(r,, r3). (26.8) 


The density matrix 0’ “above the condensate” tends to zero aS |r1—Tre| > œ; 
the density matrix ọ tends to the finite limit mo/N. This expresses the existence of 
“long-range order” in a superfluid, which is not present in ordinary liquids; 
in these, we always have ọ > 0 as |r1—re| > œ. It is this symmetry property 
that distinguishes the superfluid and non-superfluid phases (V. L. Ginzburg 
and L. D. Landau 1950). 

The Fourier component of the density matrix determines the momentum 
distribution of the liquid particles by 


N) =N fo)e" dx; (26.9) 
cf. (7.20). Substituting o from (26.8), we obtain 
N(p) = (27)? nô) +N f e'(r) e-t -€ dx. (26.10) 


The delta function term corresponds to the finite probability for the particle 
to have exactly zero momentum. 

If superfluid motion takes place in the liquid, or if it is in non-uniform and 
non-stationary external conditions (which, however, vary considerably only 
over distances large in comparison with interatomic distances), the Bose- 
Einstein condensation again occurs, but we cannot now assert that it will 
occur in the state with p = 0. The quantity £, again defined by (26.3), will now 
be a function of coordinates and time, representing the particle wave function 
in the condensate state. It is normalized by the condition | &'|? = mo, and can 
therefore be expressed as 


E(t, r) = /[n(t, rei 9, (26.11) 


Since there is a macroscopically large number of particles in the condensate 
State, the wave function of this state becomes a classical macroscopic quantity.' 


t Just as the field strength of an electromagnetic wave becomes a classical quantity for 
large photon occupation numbers in every state (cf. RQT, §5). 
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Thus there is a new characteristic of macroscopic states in a superfluid, including 
states of thermodynamic equilibrium. 
The current density calculated from the wave function (26.11) is 


: iñ poe mee 
Jcond 3m av tæ ve) 
h. 
= —n VP 
m Oe 


where m is the mass of a liquid particle. This has the significance of the macro- 
scopic current density of condensate particles, and may be equated to nov,, 
where v, is the macroscopic velocity of that motion. From a comparison of the 


two expressions, we find 
Vs = (f/m) 7®. (26.12) 


Since the motion can occur in a state of thermodynamic equilibrium (charac- 
terized by the quantity £), it is non-dissipative, and (26.12) therefore determines 
the velocity of the superfluid motion. We thus arrive at the property of such 
motion already mentioned in §23: it is a potential flow. The velocity potential 
is equal (apart from a constant factor) to the phase of the condensate wave 
function: 


$ = (f/m) ®. (26.13) 


To avoid misunderstanding, however, we should emphasize that, although 
the condensate velocity is the same as the velocity of the superfluid component 
of the liquid (and although the condensate and the superfluid component appear 
simultaneously at the A-point), the densities mno of the condensate and 9, of 
the superfluid component are not at all the same. The identity of these two 
quantities would be impossible to justify, and its incorrectness is also evident 
from the fact that at absolute zero the whole mass of the liquid is superfluid, 
whereas not all its particles are in the condensate.* 


§ 27. Temperature dependence of the condensate density 


The particle number density in the condensate is greatest at T = 0, and 
decreases with rising temperature. The limiting form of its temperature depend- 
ence as T — 0 can be found by considering the fluctuations of a macroscopic 
quantity, the condensate wave function £ (R. A. Ferrell, N. Menyhard, H. 
Schmidt, F. Schwabl and P. Szépfalusy 1968). 

First, we recall that £ is a classical quantity which corresponds to the oper- 
ator Y in the quantum-mechanical formalism. Hence, to calculate the fluc- 
tuations, we ought in principle to use that operator. However, near absolute 


+ In practice, the density of the condensate in liquid helium seems to be only a small frac- 
tion of the total density of the liquid. 
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zero, long-wavelength oscillations play the main part in the fluctuation spectrum 
of a macroscopic quantity. These oscillations in the liquid are sound waves 
described by the macroscopic equations of hydrodynamics, and it is therefore 
possible to construct an operator corresponding to £ by independent quanti- 
zation of £. 

In the present case, for Æ = yno exp i®, in the long-wavelength limit, the 
phase ® fluctuates most strongly, and is directly related to the superfluid ve- 
locity potential by (26.13). Both ¢ and ð, it may be recalled, are defined only to. 
within additive constants. The uniquely defined quantity s/no can therefore be 
expressed only in terms of the derivatives of ®, and so the Fourier components 
of its fluctuations will contain extra powers of the wave vector k, i.e. will be 
small when k is small. 

The relation of the phase @ to the potential ¢ allows ® to be directly related 
to quantities characterizing the phonon distribution in the liquid. For this pur- 
pose, we regard @, and therefore ®, as second-quantized operators, expressing 
¢ by (24.10) in terms of the phonon creation and annihilation operators: 


mu 


1/2 
B= I (sap) Cereat erm; @7.1) 


the unperturbed liquid density is written as ọ = nm, where n is the particle num- 
ber density, and the suffix 0 is omitted. According to the foregoing discussion, 
this means that the operator of the macroscopic quantity £, i.e. the long-wave 
part of the operator Ý, can be expressed as 


W = /no exp i®, (27.2) 


where no is the condensate particle density. 

We first apply this formula to calculate the momentum distribution of par- 
ticles “above the condensate” in a Bose liquid (for small momenta). In the 
single-particle density matrix o(r1, r2), for large distances |r1—re|, we can use 
the long-wave expression (27.2) for the y operator: 


No(t,, r) = (2+ (rq) P(r) ~ nlet eo ebed, (27.3) 


where the mean value is taken with respect to the state of the liquid at a given 
temperature. Since the fluctuations are small, this expression is to be expanded. 
in powers of Ê, retaining only the first non-vanishing (the quadratic) terms. 
Since Ê+ = Ê, we obtain 


Noftri, ra) = Mo — No (B%(r)) + ng (rp) &(r,)). (27.4) 


The third term tends to zero as |ra—ri| >æ, and gives the required above- 
condensate part of the density matrix; the second term is independent of r in a 
homogeneous liquid, and gives a correction to the condensate density that wilt 
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be calculated below by a somewhat different method. Using (27.1), we can 


write the above-condensate part as 


Ne' (ry, r2) = ae > i {(EF2,) e7- 0 =A + (e ety e —*2)/A} 
P 


= Ta X L (". -+ 7) eP. (rata) 
p 


where 
n, = [er¥/F—1]-}. 
Changing from summation to integration, we have 
, _ "omu mt ip . (T1 —r2) _ ap 5 
Ne (r, r2) mr n f p e (27h) : (27. ) 


This expression applies, of course, only to the contribution from small p 
(ň/p large compared with interatomic distances). The integrand in (27.5) 
immediately gives the particle momentum distribution 





1 
N(p) = = (»+7) ‘ (27.6) 
When T = 0, this becomes 
N(p) = mgmu/2np (27.7) 


(J. Gavoret and P. Noziéres 1964); when T # 0 and up «T, 
N(p) = ngmT/np? (27.8) 


(P. C. Hohenberg and P. C. Martin 1965). 
We can now determine the temperature dependence of the condensate den- 
sity. By definition, 
no(T) = n— f N) d°p/(2nh). (27.9) 


If we substitute (27.6) here directly, the integral diverges because of the zero- 
point vibrations. This is related to the invalidity of (27.6) for large p, and means 
only that we cannot calculate in such a way the value of the condensate density 
at T = 0, which must here be regarded as a given quantity. To find the required 
temperature dependence, we must subtract from no(T) its value at T = 0; 
the integral is then convergent. The result is 


nfT)—nf0) __ mu f m dp 


n0) n | p (27h? 
mT? e xdx mT? 
= On nuh f e—1 — Dnu’ (27.10) 


0 


In the calculation we have neglected the temperature dependence of the to- 
tal density of the liquid; this is legitimate, since the thermal expansion of the 
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liquid (due to the excitation of phonons) is proportional to a higher power of 
the temperature, namely T* (cf. Part 1, §67)." 

Finally, we may make some remarks on the methodologically important 
subject of a two-dimensional Bose liquid. In this case, the temperature-depend- 
ent part of the integral (27.9) diverges logarithmically for small p, where the 
formula for N(p) should have been correct. This means that in the two-dimen- 
sional case the basic assumption is incorrect, namely that there exists a con- 
densate at non-zero temperatures. In the two-dimensional case, the condensate 
can exist only at T = 0.* The position here is analogous to that of two-dimen- 
sional crystals (see Part 1, §137). As with the latter the fluctuations of the atomic 
displacements smooth out the lattice, so the phase fluctuations eliminate the 
condensate. The formal analogy between the two systems is that in both cases 
the energy depends on quantities that can appear in it only as derivatives. 
In the first case these are the atomic displacement vectors, which cannot them- 
selves appear in the energy, because the latter is invariant under displacements 
of the system as a whole. In the second case it is the phase of the condensate 
wave function, which cannot itself appear in the energy, because it is not 
uniquely determined. The dependence of the energy on only the gradients of 
these quantities is the ultimate reason for the divergence of the fluctuations. 

Next, we have seen in Part 1, §138, that the weak (logarithmic) divergence 
of the fluctuations causes in a two-dimensional crystal a slow (power-function) 
decrease of the correlation function in the system. Similarly, in a two-dimen- 
sional Bose system the density matrix (27.3) decreases as |r1—r2| — œ according 
to a power law, and does not tend to a constant limit as in the presence of the 
condensate.’ Such a system thereby differs qualitatively from an ordinary 
liquid, and so, in the two-dimensional case also, there can be a phase transition 
of the second kind between the ordinary liquid with an exponential decrease 
of o(r1, r2) and a liquid with a power-law decrease. 


§ 28. Behaviour of the superfluid density near the /-point 


As already mentioned in §23, with increasing temperature the fraction 9,/0 of 
the superfluid density in a Bose liquid decreases, becoming zero at the A- 
point of the liquid, a phase transition point of the second kind. The temperature 
T, of this point is a function of the pressure P; the equation T = T,(P) defines 
the curve of A-points in the phase diagram in the PT-plane. 

In the general theory of phase transitions of the second kind, the change in 
state of the body is described by the behaviour of the order parameter, which 


t The formulae obtained, which are valid for any Bose liquid, are of course in agreement 
with those of §25 for a slightly non-ideal Bose gas. In the comparison, it must be noted that for 
sucha gas n + n, and the condition for p to be small is p «< mu ~ &(any!?. 

+ These statements relate also to a two-dimensional ideal Bose gas. 

$ See J. W. Kane and L. Kadanoff, Physical Review 155, 80, 1967. 
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characterizes its symmetry properties. For the A-transition of a Bose liquid, the 
condensate wave function 5 acts as such a parameter, and describes, as ex- 
plained in §26, the “long-range order” in the liquid. The fact that © is complex 
means that the order parameter has two components, and the effective Hamil- 
tonian of the system (see Part 1, §147) depends only on | £[?, i.e. is invariant 
under the transformation £ — e'* E for any real «. 

The empirical results concerning the /-transition in liquid helium seem to 
indicate that there is no region in which the Landau theory of phase transitions 
is valid: the condition in Part 1 (146.15) is not satisfied anywhere in the neigh- 
bourhood of the 4-point (i.e. anywhere in the region |T—T,|<T,). Hence, 
to describe the properties of this transition, we must use the fluctuation theory 
of phase transitions of the second kind, which makes it possible to relate the 
temperature dependences of various quantities. 

The temperature dependence of the order parameter (and therefore of the 
condensate density mo) as T — T, is given by the critical index £$ (see Part 1, 
§148): 

| E | = ym œ (T,-TY. (28.1) 


A more interesting question, however, is that of the behaviour of the super- 
fluid density ọ,. To calculate it, let us consider a liquid in which the phase ® 
of the condensate wave function varies slowly in space. This means that there 
is in the liquid a macroscopic superfluid motion with the velocity (26.12) and 
accordingly with kinetic energy (per unit volume of the liquid) 


FOV? = o (h?/2m) (VÐ). (28.2) 


This expression may also be applied to the long-wavelength fluctuations of 
the order parameter. According to the hypothesis of scale invariance, the 
only parameter of length that determines the fluctuation picture near the 
transition point is the correlation radius r, of the fluctuations. This therefore 
determines the order of magnitude of the distances at which the fluctuational 
change of the phase ® is of the order of unity; hence the mean square of the 
fluctuational velocity varies with temperature according to 


v oc I/r? oc (T,-T)”, (28.3) 


where » is the critical index of the correlation radius. On the other hand, since 
it is the long-wavelength fluctuations that govern the singularity of the ther- 
modynamic quantities at the transition point, we may naturally assume that 
near this point the fluctuational kinetic energy (28.2) varies with temperature 
in the same way as the singular part of the thermodynamic potential of the 
liquid, i.e. as (T,—7)?~* (where « is the critical index of the specific heat C): 
Thus we find 


0,v2 oc 0(T,—-T)” oc (T-T), 
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whence 9, œ (T,—T)’-*—™, Lastly, with the relation 3» = 2—« (which follows 
from the hypothesis of scale invariance; see Part 1, §149), we have 


0, © (T,-T)2-93, (28.4) 


This establishes the relation between the temperature dependences of os 
and the specific heat near the 2-point (B. D. Josephson 1966).' 


§ 29. Quantized vortex filaments 


An ordinary liquid enclosed in a cylindrical vessel rotating about its axis is 
carried along by friction against the vessel walls, and is ultimately caused to 
rotate as a whole together with the vessel. In a superfluid, only the normal 
component is brought into rotation; the superfluid component remains at 
rest, in accordance with the fact that this component cannot rotate as a whole, 
since this would make the superfluid motion no longer a potential flow.* 

For sufficiently large rates of rotation, however, such a state becomes ther- 
modynamically unfavourable. The condition of thermodynamic equilibrium is 
that the quantity 

Em = E—-M.Q (29.1) 


is a minimum; this is the energy in a rotating coordinate frame, with E and 
M the energy and angular momentum of the system in a fixed coordinate frame 
(see Part 1, §26). The term —M.Q in (29.1) causes (for sufficiently large 82) the 
state with M.Q > 0 to be thermodynamically more favourable than that 
with M = 0. 

Thus, as the rate of rotation of the vessel increases, superfluid motion must 
eventually occur. The apparent contradiction between this statement and the 
condition for superfluid motion to be a potential flow is removed by assuming 
that the potential flow is lost only at certain lines of singularity in the liquid, 
known as vortex filaments or vortex lines.) The liquid executes a motion about 
these lines which may be called potential rotation, since curl v, = 0 throughout 
the volume outside the lines. 

The vortex filaments in a liquid have a thickness of atomic dimensions, 
and macroscopically they must be regarded as being of infinitesimal thickness. 
Their existence does not contradict the expression (26.12) for the velocity, 
Since the latter assumes that v, varies sufficiently slowly in space, whereas it 


t The indices « and ¢ for liquid helium are very small, and so we have with high accuracy 
b = 1/3,and o, € n  (T,—T)?!®, 

t When the liquid rotates as a whole, the velocity v = @QXr, where Q is the angular ve- 
locity and the position vector r is drawn from some point on the axis. Then curl v=292 = 0. 

$ This assumption was proposed by L. Onsager (1949) and further developed by R. P. 
Feynman (1955). 

l! This statement does not apply, however, to the immediate nieghbourhood of the A-point; 
there, the thickness of a vortex filament is of the order of the correlation radius of the fluctua- 
tions. 
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varies with arbitrarily great rapidity near a vortex filament; see (29.3) below. 
It also does not contradict the proof in §23 that superfluid motion is a potential 
flow, which made use of properties of a Bose liquid energy spectrum, since a 
vortex filament is associated with a particular macroscopically large energy 
(see (29.8) below), and the state of a liquid containing a filament cannot be 
regarded as weakly excited. 

Let us first consider vortex filaments from a purely kinematic standpoint, 
as lines of singularity in the velocity distribution for potential flow of the liq- 
uid. Each vortex filament has a particular value (27x, say) of the velocity 
circulation along a closed contour round the filament: 


$ys .dl = 27x. (29.2) 


This value is independent of the choice of the contour of integration: if Cı 
and C are two contours enclosing the vortex filament, the difference between 
the circulations along them is, by Stokes’s theorem, equal to the flux of the. 
vector curl v, through a surface spanning Cı and C2; since this surface does 
not meet the vortex filament, curl v, = 0 at all points on it, and the integral 
is zero. Hence it follows also that a vortex filament cannot terminate: either 
it is closed or it ends at the boundary of the liquid (or, in an infinite liquid, 
has both ends at infinity), since the existence of a free end of a vortex filament 
would imply that there could be a surface spanning the contour C but nowhere 
meeting the filament, and so the integral on the left of (29.2) would be Zero. 
The condition (29.2) enables us to determine the velocity distribution in a 

liquid moving round a vortex filament. In the simplest case of a straight fila- 
ment in an infinite liquid, the streamlines are circles in planes perpendicular 
to the filament, with centres lying on the filament. The circulation along such 


a curve is 27rv,, so that 
Vs = xjr, (29.3) 


where r is the distance from the filament. We may note that in potential rota- 

tion the velocity decreases away from the axis of rotation (the vortex filament), 

in contrast to rigid rotation, where the velocity increases in proportion to r. 
For a vortex filament of any shape, the velocity distribution is given by 


Vs = 4% f dIXR/R?, (29.4) 


where the integration is along the filament, and R is the radius vector from dl 
to the point where the velocity is observed. At distances from the filament 


t This expression may be written down immediately by analogy with the familiar Biot- 
Savart formula for the magnetic field of line currents. The formal equivalence of the two prob- 
lems is evident from a comparison of the velocity circulation (29.2) with the circulation of the 
magnetic field H round the line current J: 


$H dì = 4nJje. 


One problem is obtained from the other by substituting v, for H and }x for J/c. 
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that are small compared with its radius of curvature, formula (29.4) of course 
reduces approximately to (29.3). 

As already mentioned, formulae (29.2)— (29.4) result simply from the fact 
that the motion of the liquid is a potential flow. The quantum nature of vortex 
filaments in a superfluid is shown by the fact that the constant x can only have 
values in a certain discrete series. Using (26.12) for the velocity v, expressed 
in terms of the phase Ø of the condensate wave function, we find as its circu- 
lation 

bv, .dl = (film) A®, (29.5) 


where 4® is the change of phase on traversing the contour. Since the wave 
function is single-valued, its change of phase on returning to the original point 
must be an integral multiple of 27, and so 


x = nh/m, (29.6) 


where n is an integer. We shall see below that in fact only vortex filaments 
with the lowest possible circulation (n = 1) are thermodynamically stable. 


We shall therefore put 
x = film. (29.7) 


Let us now determine the critical rate of rotation of the cylindrical vessel at 
which a vortex filament first appears. It is evident from symmetry that this 
filament will be along the axis of the vessel. The change in the energy of the 
liquid due to the appearance of the vortex filament in it is 


AE = | F0v2dV = $0,L | v2.2ardr = Lox? | drir, 


where L is the length of the vessel. The integration with respect to r is to be 
taken between the radius R of the vessel and some value r ~ a of the order 
of atomic distances, at which the macroscopic treatment ceases to be meaning- 
ful; because the integral is logarithmically divergent, its value does not depend 
greatly on the precise choice of a. Thus 


AE = Lx,(h?/m) log (R/a); (29.8) 


this expression is said to have logarithmic accuracy, i.e. not only the ratio 
R/a but also its logarithm is large. The angular momentum of the rotating 
liquid is 

M = favs dV = 0% | dV = LxRXh/m) os. (29.9) 


t The motion round the vortex filament is in general accompanied by a change in the den- 
sity of the liquid. The neglect of this change in the calculation given here is justified by the fact 
that the main contribution to the energy (29.8) comes (because of the logarithmic divergence 
of the integral) from large distances r, at which the density change is small. For the same reason, 
we may neglect the contribution to JE from the change in the internal energy of the liquid. 
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The occurrence of the vortex filament is thermodynamically favourable if 
AE,» = JAE-MQ < 0, i.e. if 


R > Qe = (h/mR?) log (R/a). (29.10) 


The above arguments also indicate the reason why vortex filaments with 
n > 1 in (29.6) are thermodynamically unstable: when n = 1 is replaced by a 
value n > 1, the energy AE is increased by a factor of n?, and M by a factor n, 
which must increase AE,,y- 

When the rate of rotation of the cylindrical vessel increases further beyond 
the critical value (29.10), new vortex filaments appear, and when 2 >> Qo 
their number is very large. Their distribution over the cross-section of the 
vessel tends to a uniform one, and in the limit they simulate the rotation of the 
superfluid part of the liquid as a rigid body.’ The number of vortex filaments 
for a given (large) value of Q is easily determined by the condition that the 
velocity circulation along a contour enclosing a large number of filaments 
should have a value corresponding to rotation of the liquid as a whole. If 
such a contour encloses unit area in the plane perpendicular to the axis of 
rotation, then , 

$vs.dl = p.2nx = 2nvh/m, 


where v is the distribution density of the vortex filaments over the cross-section 
of the vessel. On the other hand, when the liquid rotates as a whole, curl 
v, = 2Q, and this circulation is 22. Equating the two expressions, we find 


vy = mnh. (29.11) 


The occurrence of vortex filaments to some extent eliminates the property 
of superfluidity. The elementary excitations that form the normal component 
of the liquid are thẹn scattered by the filaments, transferring to these (and thus 
to the superfluid çomponent of the liquid) a part of their momentum. This 
consequently implies the presence of friction between the two components 
of the liquid. 

Vortex filaments in general move about in space with the flow of the liquid. 
When T = 0 and the liquid is entirely superfluid, each element dl of the fila- 
ment moves with the velocity v, of the liquid at the position of that element. 
At non-zero temperatures, the frictional force on the filament causes it to have 
a velocity relative to the superfluid component. 

Vortex filaments formed by rotation are straight. The flow of a liquid through 
capillaries, slits, etc., may be accompanied by the formation of closed fila- 
ments or 


t This is easily seen by noting that, since the number of filaments increases in proportion 
to Q (see (29.11) below), the second term in AE, oe = AE— MQ increases as 22, but the first 
term increases as 2, and may therefore be neglected when 2 >> 2,,. Then the minimization of 
AE,.. is equivalent to the maximization of M, which occurs when the liquid rotates as a rigid 
body. 
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vortex rings. These eliminate the superfluidity in flow at velocities above a 
certain critical value. The actual values of these critical velocities depend on 
the specific conditions of the flow; they are much less than the value above 
which the condition (23.3) is violated. 

Unlike straight vortex filaments, which can remain stationary in a liquid 
that is at rest (far from them), vortex rings move relative to the liquid. The 
displacement velocity of each line element is the value of v, which results 
(according to (29.4)) at its position from the action of all the rest of the fila- 
ment; for curved filaments this is not in general zero. Consequently, vortex 
rings have as a whole not only definite energies but also definite momenta, 
and in this sense are a special type of elementary excitations. 


PROBLEMS 
PROBLEM 1. Find the velocity and momentum of a circular vortex ring. 


SOLUTION. Each element of the ring moves with the velocity v, at a given point, and from 
the symmetry of a circular ring this velocity is the same at every point of it. It is therefore 
sufficient to determine the velocity v, at any one point P of the ring due to the rest of the ring. 
The elements dl of the ring and the radius vectors R from dl to the point P are in the plane of 
the ring; hence the velocity at the point P, given by (29.4), is perpendicular to the plane of the 
ring, as a result of which the ring moves without change of shape or size. 

Let us define the position of the element dl by the angle # (Fig. 3). Then 


dl = Rd, R = 2R,sin40, |dlxR| = Rsin48.dl, 


f 


Fic. 3. 


where R, is the radius of the ring, and we find from (29.4) for the ring velocity v 





This integral, however, is logarithmically divergent at the lower limit, and must be cut off at 
a value ® ~ a/R, corresponding to atomic distances (~ a) of the element dl from the point P. 
The integral is determined, with logarithmic accuracy, by the range of values a/Ry x 0? «72, 
and is 
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so that 
v = (x/2Ro) log (Ro/a) 
= (A/2mRo) log (Ro/a). (1) 


With the same logarithmic accuracy, the energy of the vortex ring is 
& = 2n?Roo,(h?/m") log (Ro/a), (2) 


which is (29.8) with Ry and 27R, in place of R and L. The energy e is related to the velocity v 
by de/dp = v, where p is the momentum of the ring. Hence 


dp = de/v 
= 4n’ohi/m) RodRo 


{with logarithmic accuracy, the large logarithm is to be regarded as constant in the differentia- 


tion), and so 
p = 27°0,(h/m) RZ. (3) 


Formulae (2) and (3) determine the function e(p) for vortex rings in parametric form (with 
Ry as parameter). 

It may be noted that, because of the logarithmic nature of the integration that leads to 
formula (1), this formula (with some changes of notation) remains valid also for the velocity 
v with which any given element moves in a curved vortex ring of any shape: 


v = (x/2Ro) b log (A/a). (4) 


Here b is a unit vector perpendicular to the tangent plane at the given point on the filament 
(the binormal vector), Ry is the radius of curvature at that point, and 2 is the characteristic 
distance over which the curvature of the filament varies. 


PROBLEM 2. Find the dispersion relation for small vibrations of a straight vortex filament 
(W. Thomson 1880). 


SOLUTION. We take the line of the filament as the z-axis, and let r = (x, y) be a vector 
giving the displacement of points on the filament when it vibrates; r is a function of z and the 
time 7, of the form exp[i(kz— wt)]. The velocity of points on the filament is given by formula (4), 
with 4 here taken as the wavelength of the vibrations (A ~ 1/k): 


: 1 1 b 

v = dr/dt = — iwr = 5 x log ak Ro’ 

The binormal vector b = tXn, where t and n are unit vectors along the tangent and the prin- 
cipal normal to the curve. According to a well-known formula of differential geometry, 
d’r/dl? = n/Ro, where Z is the length measured along the curve. For small vibrations, the fila- 
ment is only slightly curved, and we can therefore take / ~ z and t = t, (a unit vector along 
the z-axis); then 

b/Ry © t,Xd*r/dz* = —k*t,Xr. 


The equation of motion of the filament is then 
— tor = —1xk*t,xr log (1/ak). 


On expansion, this gives two linear homogeneous equations for x and y; equating the deter- 
minant to zero, we get the required relation between w and k: 


w = 4k? log (1/ak). 
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§ 30. A vortex filament in an almost ideal Bose gas 


As already mentioned, the thickness of a vortex filament in a liquid is compa- 
rable with atomic distances. An exception in this respect, however, occurs in 
the case of an almost ideal Bose gas. Here the “core” of the vortex filament, 
where the properties of the medium are significantly altered, has (as we shall 
see below) a macroscopic thickness, and its structure may be macroscopically 
described (V. L. Ginzburg and L. P. Pitaevskii 1958, L. P. Pitaevskii 1961, E. P. 
Gross 1961). 

Let us consider a slightly non-ideal gas at absolute zero. In such a gas, almost 
all the particles are in the condensate state. In terms of y operators, this means 
that the “above-condensate” part of the operator (Ê) is small in comparison 
with its mean value, i.e. in comparison with the condensate wave function =. 
If we neglect this small part completely, Æ will satisfy the same “Schrödinger 
equation” (7.8) as the complete operator Ë. If only pair interactions are taken 
into account, this equation is (for spinless particles) 


in? 501, 2) = a A(t, r) + E(t E(t, vr’) (2U ^ d3x' 
È sq, ) =- (spy +H) BC )+5(t, r) | (S(t, v’)[2 Ur’) dx’. 
(30.1) 


Regarding the function Z(t, r’) as varying only slightly over atomic distances, 
we can replace it by Z(t, r) and take it outside the integral, which then be- 
comes f U(r)d® x = Uo. Substituting also u = nUo (see (25.6); n is the unper- 
turbed value of the particle number density in the gas), we get 


ih OE/Ot = —(?/2m) AZ + UE | E |2— nE}. (30.2 


— 


In a stationary state, & is independent of the time. A straight vortex fila- 
ment corresponds to a solution having the form 


E = /ne'*f(r/ro), ro = fi/a/(2mU pn), (30.3) 


where r and ¢ are the distance from the axis of the filament and the polar 
angle round the axis. The phase of this function corresponds to the value (29.7) 
of the circulation. The squared modulus |&'|? is the particle number density 
in the condensate; in the approximation considered, it is the same as the total 
density of the gas. When r — œ, this density must tend to the fixed value n, 
and therefore f must tend to unity. 
With the dimensionless variable = r/ro, we find for the function f(&) the 

equation 

1 df dA f 

TE ( F) -F4f-f =0. (30.4) 
Figure 4 shows the solution obtained by numerical integration of (30.4). 
When & - 0 it tends to zero as £; when Ẹ +o it tends to unity as 1—1/2é. 
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f£) 


0.5 
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The parameter ro determines the order of magnitude of the filament “core” 
radius. Using the scattering length a instead of Uo, with Uo = 4ath?a/m (6.2), 
we find 

ro ~ N-H3p-112 >> 7-113, 


where 7 = an?’ is the gaseousness parameter. This radius is therefore in fact 
large in comparison with interatomic distances if the gaseousness parameter is 
sufficiently small. 


PROBLEM 


Find the spectrum of elementary excitations in an almost ideal Bose gas, regarding it as the 
dispersion relation for small oscillations of the condensate wave function. 


SOLUTION. We consider small oscillations of £ about a constant mean value yn: 


F = yn+ Aek .r-0 4 Beek rat) 


where A and B* are small complex amplitudes. Substituting this expression in equation (30.2), 
linearizing, and separating terms with different exponential factors, we find a set of two 
equations: 

hwA = (p?/2m) A+nU,(A+B), 


—hwB = (p*/2m) B+nU,(A+ B), 
with p = Ak. Hence, equating the determinant to zero, we have 
(fico)? = ( p?/2m)?+(p?/m) nUo, 


in agreement with (25.10). 


§ 31. Green’s functions in a Bose liquid' 


The mathematical formalism of Green’s functions in a Bose liquid is very 
similar in its structure to that for a Fermi system. Without reiterating all the 
arguments, we shall give here first of all the basic definitions and formulae, 
stressing the differences due either to the different particle statistics or to the 
presence of the condensate.’ As in the preceding sections of this chapter, the 
particles in the liquid are assumed spinless. 


t In §§31-33 and 35, the units used have # = 1. 
+ The mathematical technique of Green’s functions was first applied to Bose systems with: 
condensate by S. T. Belyaev (1958). 
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In determining the Green’s function for a Bose liquid, we must separate the 
condensate part of the Heisenberg y operators by putting them in the form 
(26.4). The Green’s function is found from the above-condensate part of the 
operators according to the formula 


G(X, Xo) = — TEXY PHX), (31.1) 


where the angle brackets (...) again denote averaging with respect to the 
ground state of the system, and T denotes the chronological product. However, 
in contrast to the fermion case, the interchange of y operators to put them in 
the necessary order need not be accompanied by a change of sign of the prod- 
uct, so that, unlike (7.10), 


(OX) (KX), h> | (31.2) 


iG(X; Xa) = H +(X) PXD, l =< lz 


A similar mean value to (31.1) but with the complete y operators instead of 
the above-condensate ones would give 


-iT P(X) 2+ (X,)) = — in + G(X, Xa), (31.3) 


where no is the particle number density in the condensate.’ In a homogeneous 
liquid, the function G depends, of course, only on the difference X = X,— Xz. 

The above-condensate density matrix 0’ is expressed in terms of the Green’s 
function by 


No' (rı, r2) = iG(h, r1; tı +0, r2) = iG(t = —0,r); (31.4) 


it will be noted that the sign is opposite to that of (7.18). In particular, for 
rı = rg we get from this the total above-condensate particle number density, 
Zm = iG(t = —0, r = 0); (31.5) 
cf. (7.19). 

The change to the momentum representation is made by the same formulae 
(7.21), (7.22). The normalization of G(w, p) is expressed by 





N Pp ~ jon AO EP , 
y = roti lim, | G, p)e (27) 3 (31.6) 


cf. (7.24). 


t As with Fermi systems, we shall consider states of a Bose system for a given value of the 
chemical potential u (rather than of the number N). Accordingly, the difference H’ = H— uN 
(7.1) acts as the Hamiltonian of the system. The condensate part of.the y operator is then in- 
dependent of time. 
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For the Green’s function of a Bose system in the momentum representation 
we can derive an expansion similar to the one obtained in §8 for a Fermi sys- 
tem. Exactly analogous calculations lead first to the formula 


_ AmO(p oy Pm) 
Glo, p) = (2x) ra ENED Fee 


B mO(p + Pn) 
ey E E 31.7 

w — EN) +Em(N—1)+u— i0 l, CRD 
where 

Am = |OD (Om|, Bm = | (m |9 (0) 0) /*, 

Ņ'(r) being the Schrödinger above-condensate operator. To bring this expan- 
sion to the final form, we note that the excitation energies £€„(N) in a Bose 
system are determined as the (always positive) differences between the energies 
of the excited states of the system and the energy of its ground state for a con- 
stant particle number N. Since Eo(N)+ u ~ Eo(N+1), we therefore find that 


En(N +1)—Eo(N)—b ~ En( N+ 1)—Eo(N +1) = En(N +1) > 9, 
En(N—1)— E(N) + ~ En(N—1)-E(N—1) = En(N—1) > 0: 
But the addition or removal of one particle changes the properties of the system 
only in the terms of relative order ~1/N; for a macroscopic system these terms 


are negligible, and so the excitation energies €„(N+1) are to be regarded as 
coinciding with each other and with ¢,(N). Thus we have finally 


Amom(P—Pm) _ BmSm(p + Pm) 


O — Em +10 O L Em — iÔ (31.8) 


G(o, p= CE 
By the same method as in deriving (8.14), we easily find from this that for 
Bose systems the imaginary part of the Green’s function is always negative: 


im G(w, p) < 0. (31.9) 

The asymptotic form of the Green’s function for œw — œ remains the same 
as for Fermi systems: 

G(w, p) — 1/o as |w' +0; (31.10) 


cf. (8.15). In deriving this, we must use the commutation rule 
P(t, r) Pl, r) — PHC, ra) M(t ri) = ôe, =r), 


in which the commutator of the operators ¥ and + now replaces the anti- 
commutator.* 


t Formula (31.7) corresponds to (8.7). The factor 4 is absent here, because the particles 
are spinless. It should be noted that the sign of the second term in (31.7) is the opposite of that 
in (8.7). 

+ The fact that the condensate part of the y operators is separated in the definition of G is 
here unimportant: the constant term — ing in (31.3) corresponds in the momentum represen- 
tation to the delta function 6(@) ô(p), which does not affect (31.10). 
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Next, arguments similar to those in §8 lead to the fundamental result that 
the poles of the Green’s function determine the spectrum of elementary exci- 


tations 
G-e, p) = 0, (31.11) 


and only the positive roots of this equation are to be taken; the subtraction of 
u from e is here unnecessary, in contrast to (8.16). 
Near its pole, the Green’s function has the form 


G(o, p) ~ Zallo Fep] Z,>0, Z_ <0; (31.12) 


the sign of the residue at the pole is the same as that of w, as follows from the 
fact that the coefficients 4,, and B,, in (31.8) are positive. The magnitude of the 
residue is subject to no conditions such as (10.4) for example, for Fermi sys- 
tems. Using the expression (31.12), we can easily verify (as in §8) that the in- 
equality (31.9) automatically makes the quasi-particle damping coefficients 
positive, i.e. gives the necessary sign —im e > 0, when the values of € move 
into the complex domain. 

The possible passage of above-condensate particles into the condensate and 
back has the result that, in the mathematical formalism of Green’s functions 
for Bose systems, as well as the function (31.1), the following functions auto- 
matically appear, as we shall see in §33: 


iF(X,, X3) = (N—2|T ËX) ÊX) N), (31.13) 
iF +(X,, X,) = (NIT Ê +HX Y) PHX) N—2) 
=(N+2|T +X) +X) N), (31.14) 


where the matrix element is taken for transitions with change in the total number 
of particles in the system, and | N) denotes the ground state of the system with 
N particles; the last equation in (31.14) is valid to within quantities ~1/N 
(cf. the fourth footnote to §26). The functions F and F* thus defined are called 
anomalous Green’s functions. We shall show that in a homogeneous liquid at 
rest they are equal. 

Like the function G, the functions F and F* for a homogeneous liquid depend 
only on the difference Y = X,—X..' Since interchanging X, and X, changes 
only the order of the operators in the product, which is in any case governed 
by the chronological operator, we have 


F(X) = F(—X). (31.15) 


t The fact that the function F is independent of the sum of times 74+ tz arises because the 
term -uN is included in the definition of the Hamiltonian H’ = H— uN. This excludes from 
the difference of energy eigenvalues of systems with different numbers of particles the term 

E(N+2)—E(N) © 2@E/ON = 2p, 


and correspon dingly excludes the factor exp [— #u(t,;+1,)] from the matrix elements of the 
operator Wi Y3. 
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Hence it follows, of course, that in the momentum representation also F is an 
even function of its argument: 


F(P) = F(—P). (31.16) 


Next, a relation between F and F* results from the following property of the 
Heisenberg y operator of a liquid at rest:* 


P(t, r) = Ž(-t, =r). (31.17) 


Taking, say, tə > tı, we thus have 
iF+(X,, Xa) = (N+2| +(X) W'+(X,)|N) 


= (N [E+ (x) 2'+(%,)|N +2) 
= (N|2(—X,) ¥'(—X,)|N +2) 
= iF(—X, — Xo), 


or Ft(X) = F(— X). Using (31.15), we then obtain the required relation 
F+(X) = F(X). (31.18) 


Expressing F(X)in terms of the matrix elements of the y operators, we can 
derive for F(w, p) an expansion similar to (31.8), and thus determine the poles 
of the function, but we shall not pause to do so here, merely mentioning that 
the poles of F(a, p) coincide with those of G(a, p). 

To conclude this section, let us calculate the Green’s function G of an ideal 
Bose gas. First of all, since in the ground state of such a gas all particles are in 
the condensate, the above-condensate particle annihilation operator ¥ acting 
on the wave function of the ground state gives zero. Hence the function G(r, r) 
is non-zero only for t = tı—t > 0 (when, according to (31.2), the creation 
operator Ê+ acts first). 


t This property may be proved as follows. All non-zero matrix elements of the operators 
â, and â} can be defined as real quantities; see QM, (64.7), (64.8). In this sense the operators 


are real, i.e. dt = a* = Gp: (a denotes the transposed operator; cf. QM, §3.) The Schrödinger 


operator 
(r) = V-12 > âp eiP.-r 
p 


therefore has the property + (r) = #(—r). Hence in turn we have the equation (31.17) for the 
Heisenberg operator 
Y(t, r) = exp (Ht) Ẹ (r) exp (— iH), 


as is easily seen by noting that (for a system without spin interactions) the Hamiltonian Ê is 


real (so that Ht =H ) and unchanged by inversion. We must emphasize, however, that if the 
Hamiltonian is real, there can be no macroscopic superfluid motion in the liquid. For a Bose 
system with condensate, the Hamiltonian depends on a macroscopic parameter, the conden- 
sate wave function £. In a moving liquid, this parameter is complex, and therefore the Hamil- 
tonian also is complex (but, of course, Hermitian). 
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Although the chemical potential u = O for an ideal gas, we shall not assert 
this here, regarding u as a free parameter not specified beforehand; such a 
procedure is necessary with a view to the subsequent application of G in the 
diagram technique for an arbitrary liquid, where u acts as such a parameter. 
Accordingly, the operator r) is written as 


YO) =e Yo exp (i (P. Fatt), (31.19) 


differing from (26.1) by the term iut in the exponents. When this expression is 
substituted in the definition of G®, in accordance with (31.2), we note that on 
averaging (i.e. taking the diagonal matrix element) we can obtain a non-zero 
result only from the products 4,4* and 4*4,. But, since in the ground state of 
the gas the occupation numbers of all particle states with p ~ 0 are zero, we 
have 

(4p 4) =0, (Gar) = 1. 


Now changing in the usual way from summation over p to integration, we get 


ij 2 
GO(t, r) = i| exp | i F +iut+ip. | oe Gn) for t>0, (31.20) 
0 for ¢ <0. 


Hence the Green’s function in the momentum representation is 


; . P? : ; 
(0) =e Lp. 
Ga, p) i | exp ( ia t+iut-+ iat) dt 
ò 
The integration is effected by means of the formula 


oo 


iat — eia 
[ eit dt aLi0’ (31.21) 
ò 
derived by including in the integrand a factor e7% with 2 > 0 and then taking 
the limit as 2 + 0. Finally we have 
p? —1 
Go, p) = fe i io (31.22) 
2m 
For an ideal gas the function F(X) = 0, as is evident from the definition 
(31.13), in which both operators annihilate above-condensate particles. In the 
momentum representation also, therefore, 


F(a, p) = 0. (31.23) 


This equation expresses the fact that particles appear above the condensate 
(at T = 0) only as a result of interaction. 
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PROBLEM 
Find the Green’s function of a phonon field, defined as 
D(X,, X2) = D(X,- Xa) = — KT V (X1) (X), (1) 


where the angle brackets denote averaging with respect to the ground state of the field, and 0’ is 
the density operator from (24.10); the chronological product is expanded by the rule (31.2). 


SOLUTION. When substituting (24.10) in the definition (1) we note that, since in the ground 
state all the phonon state occupation numbers are zero, only the mean values (¢,¢+)=1 are 
other than zero. Then, changing from summation over k to integration, we obtain 


= Ok i(k. r Fukt) d'k 
De D= | are Qn 





where the minus and plus signs in the exponent refer to t£ > O and t < 0 respectively; in the 
integral for £ < 0, we have renamed the variable of integration, k — — k. The integrand (with- 
out the factor e**) is the Fourier component of the function D(t, r) with respect to the coordi- 
nates. Expanding with respect to time also, we find the Green’s function in the momentum rep- 
resentation. 


oo 0} 
Dlo, k) Z E. lj elo -ukt dt+ ae al N 


0 -00 
The integration is carried out by means of (31.21); 


ok 1 1 | as ok? 


AOS 2u | w—uk+i0  w+uk—i0 w*— u?k? +10 ` 


§ 32. The diagram technique for a Bose liquid 


The diagram technique for the calculation of Green’s functions in a Bose 
system can be set up similarly to those for Fermi systems in §§12 and 13. We 
shall again formulate the rules of this technique for systems with a pair inter- 
action between particles, described by the operator 


P(t) =} [EHe r) P(t, ra) U(r ra) H(t, my) Y(t, r) dx, d8xq. (32.1) 


The chief distinctive feature of Bose liquids with condensate is that all the 
Heisenberg y operators must be put in the form Ý = +5, where $ is the 
above-condensate part and £ the condensate wave function, which for a liquid 
at rest is simply the real number +/mo.' After this substitution, the operator 
(32.1) separates into a series of terms containing from four to no operators 
¥Y (together with the corresponding additional number of factors +/no). 

The whole discussion in §12 concerning the change to the interaction rep- 
resentation remains valid, and the subsequent expansion of the expressions 
obtained is carried out by means of Wick’s theorem, except that the inter- 


t We must emphasize that, since this quantity arises from the separation into parts of the 
exact (Heisenberg) y operator, mp is the exact value of the condensate density in the liquid 
(at T = 0). 
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change of y operators in the product being averaged does not now involve a 
change of sign. The difference in the form of the terms into which the operator 
(32.1) separates leads, however, to new elements in the Feynman diagrams. 
These will be described in their final form in the momentum representation. 

At each vertex of the diagram we again have three lines meeting: a broken 
line associated with the factor —iU(Q), with 4-momentum Q = (qo, q), and 
two particle lines, one ingoing and one outgoing. Here we must distinguish 
condensate and above-condensate particles. The continuous lines will now 
correspond to above-condensate particles, and such a line (with 4-momentum 
P = (w, p)) is again associated with a factor iG P). The lines of condensate 
particles will be drawn as wavy lines; these have an assigned 4-momentum 
P = 0 and an associated factor +/no.' Thus four kinds of vertices arise: 


(32.2) 


Vertices with one or two wavy lines are said to be incomplete. At each vertex 
there must be “conservation of 4-momentum”; in vertices (b) and (c), therefore, 
the 4-momentum of the broken line is equal to that of the continuous line, and 
in vertex (d) it is zero. The wavy lines are always external lines of the diagram, 
i.e. are joined to it at only one end, the other end remaining free. 

Each diagram that occurs in the definition of the Green’s function G(P) has 
two continuous external lines with 4-momenta P (ingoing and outgoing), and 
may also have some (even) number of external wavy lines; the total numbers 
of ingoing and outgoing external lines are equal in every diagram (this expresses 
the conservation of the total number of particles, condensate and above- 
condensate, in the system). As for a Fermi system, and for the same reason 
(see §13), only those diagrams are admissible which do not separate into two 
(or more) disconnected parts. Unlike the case of Fermi systems, however, the 
diagrams in iG all have the same sign, i.e. rule 3 in §13 is eliminated. 

Each broken line in the diagram has a complete or incomplete vertex at its 
two ends. These, however, cannot be two vertices of the type (32.2d): having 
no continuous end, such a figure cannot be attached to a Green’s function 
diagram. They also cannot be vertices of the types (32.2d and c) or (32.2d 
and b): when there are three wavy external lines, the conservation of 4-momen- 
tum at the vertices would mean that the 4-momentum of the fourth external 
line would also be zero in such a figure, and we should have a figure with four 
condensate (wavy) external lines. 


t More precisely, a factor £ is to be associated with a wavy line coming to a vertex, and a 
factor =* with one leaving; since £ is real, these factors are actually the same. 


126 Superfluidity 


A considerable number of diagrams in each order of perturbation theory, 
constructed by the above rules, are identically zero, however. This is due to 
the absence of above-condensate particles in the ground state of an ideal Bose 
gas, as is particularly clear if we trace back to the origin of the diagrams in the 
coordinate representation: all contractions of the form (2’*+¥”), in which the 
above-condensate particle annihilation operator is to the right and acts first 
on the ground state, are zero; this leaves only contractions of the form yt 

Diagrams with a closed continuous line are zero: such a line arises from a 
contraction E(t, r) &(t, r)), which is the above-condensate particle density. 
Diagrams containing a continuous line closed by a broken line 


> hein 


CO 


are zero: such a line arises from a contraction +t, r2) Y(t, r1) of two y 
operators within the same interaction operator P(t), in which $+ is to the left 
of P. 

Lastly, all diagrams are zero in which a closed circuit is formed by any 
sequence of continuous and broken lines with all the continuous lines in the 
same direction. Such a circuit can be represented as follows, with the time 
arguments of the y operators shown at the end-points of the lines: 


fi to 
EY 


The arguments at the ends of each broken line are the same.* Those of the 
functions G corresponding to the continuous lines are equal to the differences 
to—t1, t3— t2, ta—ts, t1— t4; for each closed circuit their sum is Zero, so that at 
least one of them is negative and the corresponding function G is zero. 

The above rules relate also to the diagrams which determine the anomalous 
Green’s function, the only difference being that both the continuous external 
lines must be outgoing (for F) or both ingoing (for F*). Accordingly, in these 
diagrams the numbers of ingoing and outgoing wavy lines are no longer equal, 
but the total number of outgoing lines remains equal to the total number of 
ingoing ones. The 4-momentum P is assigned to one of the continuous external 
lines, and —P to the other, where P is the argument of the required function 
F(P) or F*(P);° the sum of the 4-momenta of these two lines must be zero, by 
the “law of conservation of 4-momentum” applied to the whole diagram. 


t For a similar reason, some diagrams were zero for two-particle scattering in vacuum; 
cf. §16. 

+ In the space-time representation of the diagrams, a factor /U(X,— X2), which contains 
the delta function 6(t;— f,), corresponds to a broken line between points 1 and 2. 

$ Since F is an even function of its argument, the choice of sign for P is here unimportant. 
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The Green’s functions calculated by the diagram technique contain two 
parameters: the chemical potential u and the condensate density no; these 
parameters have also to be related to the liquid density n = N/V. One relation 
between these three quantities is given by formula (31.6), which follows im- 
mediately from the definition of the Green’s function. As a second relation, 
we use the equation (33.11) derived below, which expresses p explicitly in 
terms of the concepts of the diagram technique. 


§ 33. Self-energy functions 


Let us examine more closely the structure of the diagrams for Green’s 
functions, using the concept of the self-energy function in the same way as was 
done in §14 for Fermi systems: by considering the set of all diagrams (with 
two continuous external lines) that cannot be cut into two parts by dividing 
just one continuous line. In contrast to §14, however, there are now various 
possibilities as regards the direction of the external lines in the diagrams: as 
well as diagrams with one ingoing and one outgoing line, there are those with 
two ingoing, or two outgoing, lines. Accordingly, there are self-energy parts of 
three kinds: 

-in -iZ20 -i202 


33.1 
sO pO OS oe) 


(in this notation, the two suffixes to 2’ denote respectively the numbers of 
ingoing and outgoing continuous external lines), As well as the continuous 
external lines, the self-energy diagrams in general also have wavy (condensate) 
free ends. These are included in the definition of the self-energy function, which 
is represented here by a circle. We shall see later that the functions Yo2(P) and 
(P) are in fact the same: 

Dox P) = X20(P). (33.2) 


We may also note at this point that, since P and — P occur symmetrically in 
the definition of these functions, they are even functions: 


So P) = Eol —P). (33.3) 


As an illustration, the following are all the non-zero diagrams of the functions 
211 and Xos in the first two orders of perturbation theory: 


Or- - TE ES PPTI n DoR (33.4) 
E a a ee lee 35) 


Let us now establish the equations giving the exact functions G and F in 
terms of the self-energy functions. 
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In terms of perturbation theory, the difference G(P) —G(P) is expressed by 
a sum of an infinite number of chain diagrams 


Pein a ENE O—o-= 
| 
+e eh 


consisting of various numbers of circles joined in all possible ways by forward 
and backward (relative to the two outermost) arrows. Similarly, the exact 
function F (the function F = 0) is represented by a sum of chain diagrams 
in which the two outermost arrows have opposite directions: 


If the end member (circle and arrow) is detached from each chain, as shown by 
the vertical broken line, the set of remaining diagrams with the outermost 
arrows in the same direction will again coincide with the exact function G, 
and the set of those with the outermost arrows in the opposite direction will 
coincide with the exact F. 

We shall introduce the graphical notation for these functions, of thick arrows 
in one or both directions 


x F ppa 
P P -P P -P 


Then the foregoing assertions can be written as graphical equations consisting 
of skeleton diagrams: 


P P P P P -P P 
— E ajea easa © anadili ana O nasan 
(33.7) 
-P po =P -P p P P 


Cf. the analogous equation (14.4). In analytical form, these equations givet 


GP) = [L+421(P) G(P) + Xa0(P) F(P)] GOP), l (33.8) 
F(P) = GO(— P)[21(—P) F(P) + Zo(P) G(P)]. 


t A similar system of equations could be written for G and F*, differing from (33.8) only 
by the interchange of £a» and X49. Since F = Ft, this proves (33.2). 
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Solving these equations for G and F and substituting (31.22) for G, we obtain 
the required formulae 


GP) = $ [o+ Fg ht E(—P)], FP) = -E EaP), (339) 


where 
2 POE ; p? 
D = (2e(P)P~ |Fu(P)—o-10+-5 a] [2 (—P)+0-104 2 
(33.10) 


It must be emphasized that these relations do not depend on the internal 
structure of the self-energy functions, and therefore are not connected with the 
assumption of pair interactions between particles; they are thus valid for any 
Bose liquid. 

The energy of elementary excitations in the liquid, as a function of the 
momentum p, is determined by the poles of G and F as functions of the variable 
w. For small p, these excitations are phonons, and their energy tends to zero 
with p. Hence the function (33.10) must vanish when p = 0 and w = 0. From 
this we find the equation 


[231(0)—u]? = 20). 
As an equation for u, it has two roots, of which we must choose 
u = Ea (0) — D49(0). (33.11) 


For, in the long-wave limit, the y operator is given by (27.2), and its above- 
condensate part $’ = Y—4/no ~ ir/no®, so that X'+ =—Y" and F ~—-G; 
the latter equation is satisfied with the choice (33.11), when the numerators in 
(33.9) (in the limit P — 0) differ only in sign. The equation (33.11) is the second 
relation (see the end of §32), which, together with (31.6), enables us to express 
the parameters p and no in terms of the density n of the liquid. 

The subsequent expansion of (33.10) in series in w and p determines the form 
of the Green’s function for small values of the arguments. Here we must take 
into account that the scalar functions 21; and Xoz are expanded in powers of 
p°, and the expansion of 22, an even function of all its arguments, contains 
only even powers of w also. Putting (33.10) in the form 


D = fo -3 [Za -Zu PI] 


p? 1 ° 
7 pa — +h [u(P) +En(—Py| +Z%(P), 


we can immediately conclude that the first non-vanishing terms in the expan- 
sion have the form D = constant X (w?—u?p?-+ i0), where u is a constant, which 
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is clearly the velocity of sound in the liquid. Noting also that, from (33.11), the 
numerators in (33.9) for œw — 0 and p — 0 differ only in sign, we find 


constant 


C= ~F= PHI 


The contour rule is determined by comparing with (31.8). 

The value of the constant in the numerator can be determined by calculating 
from this Green’s function the particle momentum distribution N(p) (for small 
p) and comparing it with the known distribution (27.7). The integral 


—7; K —iw 
N (p) = Pe f G(w, p) e7”! A 


— OO 


(cf. (7.23)) is calculated by closing the contour of integration by an infinite 
semicircle in the upper half-plane (cf. the note at the end of §7), and accordingly 
is determined by the residue at the pole œ = —up+i0. The result is N(p) = 
= constant/2up, and by comparison with (27.7) the constant is found to be 
nomu?/n. Thus we have finally the following expression for the Green’s functions 
with small w and p: 

G = —F = nomu?/n(w? — u2p? + iQ). (33.12) 


This function coincides (apart from a normalization coefficient) with the 
Green’s function of the phonon field (see $31, Problem)—an entirely reasonable 
result, since for small w and p the elementary excitations in a Bose liquid are 
phonons. 

Lastly, let us show the application of the above formulae to the model(§25) 
of an almost ideal Bose gas with pair interaction between particles. In first- 
order perturbation theory, “11 and Xoz are determined by the first two diagrams 
(33.4) and the first diagram (33.5). Expanding these in analytical form, we find 


Zu = [Uo +U (p), Log = MU (p). 


With the same accuracy, the condensate density no in these formulae may be 
replaced by the total gas density n. As mentioned in §25, the gas particle mo- 
menta in this model may be regarded as small, and accordingly the Fourier 
components U(p) may be replaced by their value Uo at p = 0. Then 


Zu = 2nU, Zis = nUo. (33.13) 
Substitution of these expressions in (33.11) gives u = nUo, in agreement with 


(25.6). Substitution in (33.9) and (33.10) leads to the following formulae for 
the Green’s functions: 


_ wo+p?/2m+nU, 
G(@, p) = wep) i0 ’ 
U (33.14) 
F(%, p) = 7! 


w?—e%(p)+i0 ’ 
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2 \2 2 1/2 
e(p) = (zr) + = nU] : 
From the form of the denominators in these functions, it is clear that (p) is 


the energy of the elementary excitations, in agreement with the result (25.10), 
(25.11) obtained previously by a different method. 





§ 34. Disintegration of quasi-particles 


The finite lifetime (decay) of a quasi-particle in a quantum liquid may be 
due either to collisions with other quasi-particles or to spontaneous disintegra- 
tion into two or more new quasi-particles. As the temperature T — 0, the first 
cause of decay disappears, since the collision probability tends to zero with the 
quasi-particle number density, and the decay is then due only to the disintegra- 
tion of quasi-particles. 

Let us consider the disintegration of a quasi-particle (with momentum p) 
into two. If q is the momentum of one of the resulting quasi-particles, that of 
the other is p—q, and the law of conservation of energy gives the condition 


e(p) = e(q)+e('p—q!). (34.1) 


It can happen that in some range of values of p this equation is not satisfied 
for any q; the quasi-particles in such a range do not decay at all (if, of course, 
disintegration into a larger number of quasi-particles is also impossible). As p 
varies, decay begins at the value p = p, (disintegration threshold) for which 
equation (34.1) first has solutions. 

First of all, it should be noted that at the point p = p, the right-hand side of 
(34.1) has an extremum as a function of q. For let the extremum value of the 
sum e(q)+e(|p—q!) for a given p be E(p); we shall take the particular case 
where this is a minimum. Then, in the equation 


e(p)—E(p) = e(q)+e(|p—4q!)—£(p), 


the right-hand side is non-negative. The equation therefore certainly has no 
roots for values of p such that e(p)—E(p) < 0; a root appears only at the point 
P = Pe for which e(Pe) = E(p.). 

Putting equation (34.1) in the symmetrical form 


e(p) = €(4,) +242), Gi +92 = P, 


we find that the condition for an extremum of its right-hand side may be 
written 0¢/Oq, = O¢/Oqo, or 
vi = Vos (34.2) 


i.e. the two quasi-particles formed at the threshold point have equal velocities. 
Here we may distinguish various cases (L. P. Pitaevskiï 1959). 
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(a) The quasi-particle velocity in the Bose liquid is zero for momentum 
p = po corresponding to the roton minimum on the curve in Fig. 2 (§22). 
Hence, if v; = V2 = 0, the quasi-particle disintegrates at the threshold into two 
rotons with momenta po and energies 4. Accordingly, the energy of the dis- 
integrating quasi-particle is e(p,) = 24 and its momentum p, is related to po 
by the condition p, = Po1+ Poe, i.e. 2po cos 8 = p,, where 26 is the angle be- 
tween the directions of the two rotons. Hence it follows that we must always 


have 
Pe < 2Do- (34.3) 


(b) If the velocities are vı = V2 # 0, and the corresponding momenta qı 
and qx are finite, the disintegration at the threshold yields two quasi-particles 
with collinear (parallel or antiparallel) momenta.' 

(c) If the velocities vı and v2 are non-zero but one of the momenta (qi, say) 
tends to zero near the threshold, the corresponding quasi-particle is a phonon 
and the velocity vı = u. We then have a threshold beyond which the creation 
of a phonon by the quasi-particle becomes possible. At the threshold itself, 
the phonon energy is zero, and the quasi-particle velocity reaches that of sound 
(equal to vy = v2 = u). 

(d) There is one special case, in which there is a disintegration of one phonon 
into two, the threshold being at the start of the spectrum, p = 0. Such a dis- 
integration is, however, possible only for one sign of the curvature of the initial 
(phonon) part of the spectrum: we must have d?e(p)/dp? > 0, i.e. the curve of 
e(p) must turn upwards from the initial tangent € = up. This is easily seen by 
representing this part of the spectrum as 


e(p) ~ up+ap', (34.4) 


which includes both the linear term and the next term in the expansion in powers 
of the small momentum.* The equation of conservation of energy (34.1) then 
gives 

u(p—q—|p—q|) = —a(p?—g°— | p—q|*). 


Near the threshold, the phonon is emitted at a small angle 0 to the direction 
of the initial quasi-particle momentum p; on the left-hand side we have 


Pq 
—g—|p—aq| x -—*_(1—cos 9), 34.5 
p—q-|p—q| ~ na cos 0) (34.5) 


t Because of the isotropy of the liquid, the directions of the quasi-particle momentum p 
and velocity v = 0¢/dp are collinear, but they may be in either the same or opposite directions. 

+ The dispersion relation for acoustic vibrations gives the squared frequency was a func- 
tion of the wave vector. Accordingly, the squared phonon energy e*(p) has a regular expansion 
in powers of the momentum p; the expansion begins with a term in p*, and continues in 
powers of p? because of the isotropy of the liquid. The expansion of e(p) itself therefore 
contains odd powers of p. 


§ 34 Disintegration of Quasi-particles 133 


and on the right-hand side it is sufficient to put |p—q' ~ p—q. Then 
1—cos 0 = 3a(p—gq)?. (34.6) 


Hence it follows that necessarily « > 0. 

We shall see later (§35) that in cases (a) and (b) the function e(p) cannot be 
continued beyond the threshold, which is thus the end of the spectrum. In 
cases (c) and (d) the disintegration of a quasi-particle with emission of a long- 
wavelength phonon causes a slight decay which may be determined by means 
of perturbation theory.* 

Let us calculate the decay of a phonon due to its disintegration into two 
phonons (case (d)). The matrix elements for this process come from the third- 
order terms in the Hamiltonian, given by (24.12). For a transition from the 
initial (i) state with one phonon p to the final (/) state with phonons q; and qə, 
the matrix element of the perturbation operator is 


B 32x) /u 1/2 e du), 
Vi = O(P— 41 — %) FVE (= pave fı +3 do a ; (34.7) 
the suffix 0 of the unperturbed density go is omitted. The factor (pq1ıq2)"? 
should be noted; its smallness (in this disintegration of a long-wavelength 
phonon) ensures the applicability of perturbation theory.* 

The differential disintegration probability per unit time is given by 


27 V2 d3q, d°q, 
= 12 — one RB ES we 
dw = i Fj ô(E;— E;) (Onis ; 


see OM, (43.1). When (34.7) is substituted, a squared delta function appears, 
which is to be interpreted as 


[CP — q: — 42)? = hd 3(P — 91 — Q2). (34.8) 


The remaining delta function is removed by integration over d°qo; putting also 
E; = up, E; = u(qit+q2), we obtain 


a 2 d a On a°q, 


w= kando o Bip | PAP) AP-a |P— 4D TA 


+ Which of these cases can actually occur in practice depends on the specific form of the 
quasi-particle spectrum ¢(p). Empirical results for liquid He‘ indicate the presence (at 
pressures below 15 atm) of a short initial section of the phonon spectrum in which is an 
instability of type (d). The spectrum in liquid helium terminates at a point of type (a). 

+ To calculate the matrix element (34.7), we must take into account that each of the phonon 
operators ĉp and é> can be taken from any of three factors 6’ and ¥; this is the reason for the 
factor 3!. The delta function in (34.7) arises from the integration of the factor exp[i(p—qi— q2)» 
.r/h). Lastly, we have used the fact that the directions of p, qı and q, are almost the same. 

$ The function ô(k) arises from the integral f e*Td?x/(2x)?. If the other similar integral is 
calculated atk = 0 (because one delta function is already present), and the integration is taken. 
over a finite volume V, we get V//(2)°, as is expressed by formula (34.8). 
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with independent integration over dq; and dg, the result must be halved, 
because of the identity of the two phonons. Finally, expressing the argument 
of the delta function in the form (34.5) and carrying out the integration over 
dq, = 2nqi dqid cos 0 (in the range qı = p), we find the total disintegration 


probability 
_ 3 02 d ut)? 
w= 320m0n* fı E . (34.9) 
The phonon decay coefficient y = —im e = +w. In particular, for an almost 


ideal gas, according to (25.11), u?/o ~ 4h?a/m? is independent of the density. 


In this case 
y = 3p*/640nh%o (34.10) 


(S. T. Belyaev 1958). 

For phonon emission by a quasi-particle near a threshold of type (c), the 
form of the perturbation operator is established by considering the change in 
the quasi-particle energy in the sound wave. This change consists of two parts: 


de(p) = So tye 
The first term is due to the change in the density of the liquid, on which the 
quasi-particle energy depends as a parameter. The second term, in which v is 
the liquid velocity in the sound wave, is the change in the energy of the quasi- 
particle because of the macroscopic motion of the liquid; since the wavelength 
of the phonon emitted (near the threshold) is large compared with the wave- 
length of the quasi-particle, we may suppose that the latter is in a uniform flow 
of liquid, and the change in its energy is then determined as shown at the begin- 
ning of §23. The perturbation operator is found from ôe by replacing v = Jọ 
and 9’ by the second-quantized operators (24.10), and p by the quasi-particle 
momentum operator ĵ = —iAV: 


ðe 1 
p A oto 7 ` . 
ao tay A 
in the second term, the product has been symmetrized in order to bring it to 
Hermitian form. The phonon emission probability is then calculated as pre- 
viously for phonon disintegration (see Problem). 


PROBLEM 


Determine the probability of phonon emission by a quasi-particle whose momentum p is 
close to the threshold value p, at which the quasi-particle velocity reaches that of sound. 


SOLUTION. The matrix element of the operator (34.11) is taken for the creation of one 
phonon (with momentum q) and the simultaneous transition of the quasi-particle between 
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States (plane waves) with momenta p and p’. Neat the threshold, the phonon momentum 
q < Pe, and the direction of q is almost the same as that of p.t We then find 


A 
Vy, = — i(2nh)? 0(p — 4, — Ge) par (= 


where 


Pee Q ze] 


u do p ~pe : 
Hence the differential phonon emission probability is 


ae 


— TAU 42 E E E : 
dw = fio A*d[e(p)— eli p-a!) ug] Gah ` 


the delta function of momenta is removed by the integration over d?p'. Writing the argument 
of the delta function in the approximate form — ug({1— cos 9) and integrating over d°g, we 


obtain 
w = 2A*(p— pò Bagh. 


§ 35. Properties of the spectrum near its termination point 


In this section we shall consider the properties of the spectrum of a Bose 
liquid near the decay thresholds (thresholds of disintegration) of elementary 
excitations into two quasi-particles, neither of which is a phonon (cases (a) 
and (b) in §34).* In contrast to phonon-creating disintegrations, these cases do 
not allow the application of perturbation theory, and in order to investigate 
them it is necessary to elucidate the nature of the singularities of the Green’s 
functions of the liquid at the threshold points. On the other hand, since we are 
interested only in these singularities, we can to a large extent schematize and 
thus simplify the calculations. In particular, we need not distinguish between 
the functions G and F (since their analytical properties are the same), and can 
proceed as if there were only one type of Green’s function; taking account of 
the difference between G and F would simply produce some terms with analo- 
gous analytical properties in the equations, which would not affect the results. 

The fact that the relevant singularity of the Green’s function is related to the 
disintegration (decay) of a quasi-particle into two others means, in terms of the 
diagram technique, that it arises from diagrams of the type 


Q 
4 _ D- (35.1) 
P-0 


t We are taking the particular case where the phonon is emitted in that direction (and not 
the opposite one). For this to be so, (p) near the threshold must have the form 


e(p) © &(p.)+(p—p,) u+a(p— p,)*, 


with a plus sign in the linear term. From the law of conservation of energy, we easily see that 
phonon emission is then possible if « > 0, and occurs when p > p,; the momentum of the 
emitted phonon takes values in the range 0 = q = 2(p—p,). 

+ The results in this section are due to L. P. Pitaevskil (1959). 
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which may be cut across two continuous lines, i.e. which contain two-particle 
intermediate states. In these diagrams, there is integration with respect to the 
intermediate 4-momentum Q = (go, q), and a decisive role (as regards the oc- 
currence of the singularity) is played by the range of values of Q and P-Q 
with which quasi-particle decay products are formed near the threshold. The 
basic proposition in the theory given below is that this range of 4-momentum 
values is not special as regards the Green’s function G(Q), which has there the 


usual pole form 
G(Q) = Go, 4) < [go — elg) + 10]-*, (35.2) 


where the function ¢(q) is the energy of the quasi-particles formed and has no 
singularity. The only physically distinctive feature of this range is that within 
it the quasi-particle can “stick” to another quasi-particle, but this process is 
impossible at zero temperature, because of the absence of real excitations. The 
only special region as regards the Green’s function is the range of P values 
(external lines in the diagrams (35.1)) near the decay threshold of the original 
quasi-particle. 

The two joining linesin the diagram (35.1)correspond to factors G(Q)G(P—Q), 
and there is integration with respect to Q. Here, since only a small range of Q 
values is important, the remaining factors in the diagram may be taken as 
constant in the integration and equal to their values at the threshold Q = Q..* 
Thus the diagram includes a factor expressed by the integral 


i d'Q 
(272)t J [go— eg) +i0][@—go—a(|p—q|) +20] * 
where P = (w, p). The integration with respect to go is carried out by closing 


the contour of integration with an infinite semicircle in one half-plane of the 
complex variable go, and gives 


II(P) = 


d°q 


1 
as Gay | w—@(q)—e(|p—q]) +10 ` er) 


We shall return later to the study of this integral; first, we must express in 
terms of it the required exact function G(P), summing for this purpose all dia- 
grains of the form (35.1). 

For the function G(P) we can write a Dyson diagram equation: 


Q 
ar ee =D (35.4) 
P-Q 


t This statement needs to be made more precise. Since the factors G(Q)G(P— Q) are inde- 
pendent of the angle ġ which defines the position of the (p, q) plane. Hence the integration 
with respect to @ amounts to averaging the rest of the integrand with respect to œ$, and then 
d'Q can be taken as 2q*dqodq d cos 9. In this integration over d‘Q, only a small range is 
important. This remark applies also to corresponding stages in the following calculations. 
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Here the thick lines represent the exact function iG, and the thin lines the 
“non-singular” part of this function, determined by the set of diagrams that 
cannot be divided across two lines. The second term on the right of (35.4) 
represents the set of diagrams of the form (35.1). The white circle stands for 
the exact “three-point” vertex function, which we denote by I‘(Q, P—Q, P); 
the shaded circle is its non-singular part, which excludes diagrams that can be 
divided across two continuous lines.’ As explained previously, the integration 
over dQ leads to the presence of a factor [I(P), and the remaining factors in the 
diagram are replaced by their values at Q = Q.. Thus equation (35.4) signifies 


that 
G(P) = a(P)+b(P) G(P)I (P) IT(P), (35.5) 


where I'P) = '(Q,, P—OQ,, P), and a(P), b(P) are some functions regular near 
the threshold P = P,. 

In (35.5) there are two singular functions G and I’, and to express them in 
terms of IZ a further equation is therefore needed. This is found by noting that 
the exact vertex function I is represented by a “ladder” series: 


po Dem eo Jo- 


similar to (17.3) for a four-point vertex function. Summation of this gives the 


equation 
Q Q ò 
Q 
D-E = “ek + Yar pF 
P 
P-Q P-Q pag Or 


(cf. (17.4)); in analytical form, with Q ~ Q, it gives 
IAP) = c(P)+4(P)II(P)I AP), 


where c(P) and d(P) are regular functions. Now eliminating J", from the two 
equations obtained, we find the required expression for the Green’s function 
in terms of IT: 

A(P) IT(P) 


OP) = TE BP) ITY 


+C(P), (35.6) 
where A, B and C are functions that are likewise regular (near P = P.). 

The subsequent calculations are different for the various types of quasi- 
particle disintegration. 


t The situation here is analogous to Dyson’s equation in quantum electrodynamics (see 
RQT,§104): as there, the whole required set of diagrams is obtained by applying corrections 
to only one of the vertex functions. 
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(a) Threshold for disintegration into two rotons 


In this case, the energy e(q) of the particles formed near the threshold is given 
by (22.6), and the integral (35.3) becomes 


I(w, q) = f fo- 24- lara +(p—a!— a D 





For the integration, we use new variables q), q,, according to the definitions 


qx = (posin 6 +q) cos ¢, 

qy = (posin 6 +) sin ¢, 

q: = pocos 0 +q: 
the z-axis being in the direction of p, and the angle 9 defined by the equation 
2po cos 0 = p. Near the threshold, q; and q, are small, and we have with the 


necessary accuracy 
q = Pot+q,sin6+4q; cos 8, 


'P—q; ~ po +gesin @—q; cos 0, 
dq ~ posin 0 dq; dq; dd. 
The expression in the braces in (35.7) becomes 


1 


m* 





fo — 24 — (qe? sin? 6+ qz? cos? J 


and after a further change of variables 
qasin 0 = y/m*ocosy, gq,cos@ = y/m* osiny 
we find, integrating with respect to y, 


—___ "Po o do 
MED a] —w+24+0?° 


The divergence of this integral for large p is due only to the approximations 
made, and is not important; cutting off the integral at some value œ > |24—w| 
gives a contribution only to the regular part of II. The singular part of this 
function, with which we are concerned, arises from the range near the lower 
limit of integration, and is found to be 


1 
H œ log 2% . (35.8) 


For small values of 24—«, this logarithm is large; substituting (35.8) in 
(35.6) and expanding in inverse powers of the logarithm, we obtain 


~1 
Gw, p) = b+e( 108375) ; 
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where a, b and c are further regular functions of œ and p. At the threshold 
(p = p,), the energy of the disintegrating quasi-particle is 24. Since the energy 
of quasi-particles is determined by the zeros of G7}, this means that G~(24, P.) 
= 0, and for this we must have b(24, p.) = 0. The regular function b(w, p} 
is expanded in integral powers of the differences p— p, and w—24; replacing 
also the regular functions a(w, p) and c(w, p) by their values at the threshold, 
we arrive at the following expression for the Green’s function in the region 
near the threshold: 


G-Xo, p) = f |2- Dota (Ios a) | (35.9) 


where a, « and # are constants. 

Equating this expression to zero, we find the spectrum e(p) near the threshold. 
If the range in which disintegration is impossible lies at p < p,, € < 24, the 
constants x and a must be positive, and the equation G~1 = 0 has the non- 
decaying solution 





a 
€ = 24 —a exp | — ; 35.10 

p ( Pe ~p ) ( ) 
We see that the spectrum reaches the threshold with a horizontal tangent of 
infinite order. In the range p > p,, however, the equation G71 = 0 has no 
solutions, real or complex, with £ ~ 24 for p ~ p, In this sense the spectrum 
does not continue beyond the threshold, but terminates there.’ 


(b) Threshold for disintegration into two quasi-particles with parallel momenta. 


Since at the threshold, with p = p, the expression «(¢)+ e(|p—q!) as a 
function of q must have a minimum, its form near the threshold is 


e(q) + e(|p—q!) = e+ UA p—pe)+%(q—Go)?+A[(q—Go)-Pc]”, (35.11) 


where « and # are constants, v, is the velocity of each of the quasi-particles 
formed by disintegration at the threshold, and qo is the momentum of one of 
them. Substituting (35.11) in (35.3) and using new variables of integration 
defined by 

P = 4—-Qo, P-Pc = OPc COSY, 


we obtain 
= o? do d cos y 
H (w, p) z aay | E— Ec— Ve P— Pc) — %07— Bop? cos? y A 


This integral has a square-root singularity at the threshold: 
IT œ [vel p—pe)—(€— &c)}¥?. (35.12) 


+ As already mentioned in the third footnote to §34, the spectrum in liquid helium in fact 
ends at a point of this type; the curve in Fig. 2 approximates to the straight line € = 24, with 
a horizontal] tangent. 
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Substituting this in (35.6), we find the Green’s function near the threshold: 
G~*(, p) = A(@, p)+ B(w, p)[ve(p—pe)—(w — &¢) #2. 


Since G~1(e,, Pe) = 0, and A and B are regular functions, we can expand the 
latter in powers of p—p, and w—<¢,, and finally obtain 


G-1! œ [v( p—Pe)—(@— &_) 2+ [a( p—p-)+b(w— «)], (35.13) 


where a and b are constants. 

The form of the spectrum is determined by the equation G~(e, p) = 0. 
We seek its solution in the form e—e, = v,(p—p,)+constant x (p—p,)*; if this 
exists for p < p,, we must have a+bv, > 0, and then 


E= Et Uc P —Pc)— (a+ bv,)?(p—pe)?. (35.14) 


With the same condition at p > p,, the equation G~! = 0 has no solution with 
£ = £, for p~p,. Thus in this case also the spectrum terminates at the thresh- 
old. 


CHAPTER IV 


GREEN’S FUNCTIONS AT NON-ZERO 
TEMPERATURES 


§ 36. Green’s functions at non-zero temperatures‘ 


THE definition of the Green’s function of a macroscopic system at non-zero 
temperatures differs from that at zero temperature only in that the averaging 
with respect to the ground state of a closed system is replaced by an averaging 
over the Gibbs distribution: the symbol (...) now denotes 


Q-E’ 


Ca) = Duda!) wn = exp ( : (36.1) 





where the summation is over all states of the system (distinguished both by 
the energy E, and by the particle number N,), E, = E,—UN,, and (n|...|n) is 
the diagonal matrix element with respect to the mth state. The mean values thus 
defined are functions of the thermodynamic variables T, u and V. 

In the study of the analytical properties of Green’s functions at non-zero 
temperatures (L. D. Landau 1958) it is convenient to use what are called re- 
tarded and advanced Green’s functions, whose analytical properties are sim- 
pler.* We shall take the particular case of Fermi systems. 

The retarded Green’s function is defined by 


(PAX UF (Xa) + EF (Xo) VK), ta > Ey 


0, ty < to. 


iG (Xn, X2) = | (36.2) 


For a microscopically homogeneous non-ferromagnetic system, in the absence 
of an external field, this function (like the ordinary G,,) feduces to a scalar 
function depending only on the difference X = ¥,— X2: 


GR(X1, X2) = b6G(X), GP = $ GR. (36.3) 
The change to the momentum representation is made in the usual way. 


But, since G*(t, r) = O for t < 0, in the definition 


GR(w, p) = f f elet“P- GRC, r) dt dBx (36.4) 
0 


t In §§36-38, the units used are such that # = 1. 
+ These functions are customarily denoted by indices R and A respectively. 
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the integration with respect to fis actually taken only from 0 to œ. The dis- 
placement of the variable w into the upper half-plane simply improves the 
convergence of such an integral. Hence the integral (36.4) defines an analytic 
function without singularities in the upper half-plane of w.t In the lower half- 
plane, where the function G? is defined by analytical continuation, it has poles 
(see below). 

We can obtain for G? anexpansion similar to (8.7) for G at T = 0. Expand- 
ing the matrix element (1! ...' n) of the product of y operators by the matrix 
multiplication rule and expressing the matrix elements in the form (8.4), we 
have 

IGR(t, 1) = $ Y, wale Omt kina (in| a0) | mm PEO) n 
n,m 


+. eil Omni —Kmn . Xn ' yz (0) l mym ' Pa(0) | n)\, 


where 0, = E,,—E,, Km, = P,,—P,. The summation over n and m has 
slightly different meanings for the two terms in the braces: in the first term, 
the numbers of particles in the states n and m are related by N,, = N,-+-1, 
and in the second term by N, = N,,—1. In order to eliminate this difference, 
we interchange the suffixes m and 7 in the second sum. Noting also that 


(n | P(O)! my (mn | Pz (0) ny = |! Pa(O)! m)? = Anny 
we can write the whole expression as 


iGR(t, r) = $} Y wpe mat —Knn DA ma(L + e7enn/T), t>0. (36.5) 

Lastly, in calculating the integral (36.4) we replace œw (as in §8) by w+i0, 
obtaining finally 

GR(%, p) = 1 (2x} y Wn AmnÒP — Emn) 


—Omp/T 
ip bene), (36.6) 


It should be noted that all the poles of this expression lie (in accordance with 
the above analysis) below the real axis, in the lower half-plane of w. 

The latter property is sufficient to establish a certain relation between the 
real and imaginary parts of the function, called the Kramers-Kronig relation 
or the dispersion relation: 


re GR(a, p) = Lp f u; (36.7) 


— 00 


compare the similar relation for x(w) in Part 1, §123. The validity of this can 
also be verified directly, by separating the real and imaginary parts in (36.6) 


t Compare the analogous discussion for the function «(w) in Part 1,§123. The correspond- 
ence of the analytical properties of the functions GË and a@ is, of course, not due to chance. 
From Part 1, (126.8), « is expressed similarly in terms of a certain operator commutator. 
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through the use of (8.11). It may also be noted that the latter formula allows 
(36.7) to be rewritten as 


1 C ou, 
GF(o, p) = — | 2 P) du, (36.8) 


where 
o(u, p) = — 474 È WA mnO(U— wmn) Op — Kn) + e7 Ome’? ), 


m,n 
For real w, ọ = im G°. 

The representation (36.8) acquires a deeper significance if we take the 
“macroscopic limit” V — æ (for a given ratio N/V). In this limit, the poles 
mn coalesce, and e(u) becomes non-zero for all u, and is not just a sum of 
delta functions at discrete points. Then formula (36.8) determines G*(w) di- 
rectly in the upper half-plane of œw and on the real axis. To determine G*(«) 
in the lower half-plane of œw, it is necessary to make an analytical continuation 
of the integral, and this requires the contour of integration to be deformed in 
such a way that it always passes below the point u = w. Here G®(w) may have 
singularities in the lower half-plane (at a finite distance from the real axis), 
and the contour of integration is then “pinched” between the pole u = w 
and the singularity of the numerator. 

The advanced Green’s function is defined similarly by 


0 ti > fe, 


nA = ° 
iG Xi, X2) E Dit (Xo) + Pt (Xo) V(X), ti < ta 


(36.9) 
The function G4(a, p) in the momentum representation is an analytical function 
of the variable w, without singularities in the lower half-plane. Its expansion 
differs from (36.6) by a change of sign of i0 in the denominators. This means 
that on the real axis G4(w) = G¥"(w), and throughout the w-plane 


GA(o*) = GF*(o). (36.10) 
As w — œ, GË and G4 tend to zero in the same way as G: 


GR, G4 — l1/jw as |w| +o. (36.11) 


The coefficient unity in this asymptotic expression is determined (see the deri- 
vation of (8.15)) by the discontinuity of the function at t = tı, which is inde- 
pendent of temperature and is the same for all three functions G*, G^, G, as is 
clear from their definitions. 

To establish the relation between the functions G? and G4 thus defined and 
the ordinary Green’s function 


iGap(X1, X2) = (T PLAX) DX), (36.12) 
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we obtain for the latter an expansion analogous to (36.5). Calculations exactly 
similar to those above give the result 


G(o, p) = 5 (ony 2 WnAmnO(Pp—Kmn) X 


ao (1+e-°mv/T) + i76(@—Wmn) (1 -e-omT)] . (36.13) 
Comparison of (36.13) and (36.6) shows that 


GR(w, p) 
G4(%, p) 


The same expression (36.13) also shows that 


|- re G(œ, p)+i coth («/2T) im Go, p). (36.14) 


sgn im G(w, p) = —sgn w. (36.15) 


It should be noted that G, unlike G? and G4, is not an analytic function of œ. 
As T — 0, coth (m/2T) — sgn w, it follows from (36.14) that on the real 


axis 
GÈ, w>0 
Gs i A (36.16) 
lee w < a 


Thus the function G(w) for T = 0 is equal, on the two halves of the real axis 
of w, to the limits (as |im w| — 0) of two different analytic functions: GF (w) 
on the right half and G4(«) on the left half. 

It is easy to write down expressions for G? and G^ in an ideal Fermi gas. 
We need only observe that they satisfy the same equation (9.6), the derivation 
of which used only the magnitude of the discontinuity of the function at tı = t2. 
The method of passing round the pole is known from the fact that the pole 
must be below the real axis for GO and above it for G4. Hence 

2 


-1 
GOR, 4(%w, p) = anon" io] : (36.17) 


which is valid at both zero and non-zero temperatures. For the function G, 
we find, according to (36.14), 


1 w pP? 
GO EN: OLENE ENA Lalo Aakal ; 
(o P) = P ar ir tanh y ô (o f- +a) (36.18) 


As T + 0, we return to formula (9.7), which differs from (36.17) in that +10 is 
replaced by i0. sgn w. 


t In changing to the momentum representation, the integral with respect to f is divided 
into two parts, from — oo to 0 and from 0 to œ; in one of these, the summation suffixes m and 
n are interchanged. 
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The corresponding formulae for a Bose system are as follows. The retarded 
and advanced Green’s functions are defined by 
CPX) PHX- PHX) P(X), t > to, 
0, fı < t23 (36.19) 


0 ty > to, 


(PXA Lt (Xo) — P+ (Xo) O(N), t< te. 


iGR(X,, X2) = | 
iG4(X1, X2) = | 


For temperatures above the A-point, these definitions involve complete y oper- 
ators; below the /-point, they relate to above-condensate operators. Instead 
of (36.6) we now have 


GR(co, p) = (27) Y, w, Ame kmn) 


— e~ mn/T 
iy ey ememT). (36.20) 


This function is related to G by 
GR(w, p) = re G(w, p)+i tanh (@/2T).im G(a, p). (36.21) 
On the real axis, 
im G(o, p) < 0; (36.22) 


G is defined, according to (31.1), with averaging over the Gibbs distribution 
instead of averaging with respect to the ground state. For an ideal Bose gas, 
the function G” is given by the same formula (36.17), and G is 


1 , w p? 
(0) SP TeS pa a : 
Gw, p) = P 7 iz coth 7T ò (o 3 + n) . (36.23) 


The physical significance of the Green’s functions at non-zero temperatures 
is essentially the same as at T = 0. The formulae relating G to the particle 
momentum distribution (7.23) and to the density matrix (7.18), (31.4), remain 
valid, of course. 

The basic propositions which assert the coincidence of the poles of the 
Green’s function with the energy of the elementary excitations also remain 
valid (but, since G itself is not analytic, it is here more convenient to refer to 
the poles of the analytic function G? in the lower half-plane of w, or to those 
of G^ in the upper half-plane). This statement again (as in §8) follows from the 
expansion (36.6). Although different terms of this expansion now contain the 
transition frequencies ,,, between any two states of the system, there still 
remain (after taking the macroscopic limit) poles corresponding only to transi- 
tions from the ground state to states with one elementary excitation. Transitions 
between two excited states do not produce a pole in the macroscopic one- 
particle Green’s function, for the same reason that no pole results from tran- 
sitions from the ground state to states with more than one quasi-particle (see§8): 
the energy difference of such states is not uniquely determined by their momen- 
tum difference. 
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We must also emphasize that at non-zero temperatures the lifetime of quasi- 
particles is governed not only by their intrinsic instability but also by their col- 
lisions with one another. The decay from both causes must be weak if the con- 
cept of quasi-particles is to continue to be meaningful. 


§ 37. Temperature Green’s functions 


To construct diagram techniques for calculating the Green’s function at 
non-zero temperatures, it would be necessary to change from the Heisenber g 
representation of the y operators to the interaction representation, as in §12. 
This would again lead to an expression differing from (12.12) only in that the 
averaging is not with respect to the ground state. This is, however, a very im- 
portant difference: the averaging of the operator §-? can no longer be separated 
from that of the other factors as was done in going from (12.12) to (12.14). 
The reason is that a state other than the ground state is not converted into itself 
by the operator 5~1, but into some superposition of excited states having the 
same energy (which includes the results of all possible mutual scattering pro- 
cesses of quasi-particles). This causes a considerable complication of the dia- 
gram techniques, and new terms arise from contractions involving also y oper- 
ators from $7}, 

We can, however, alter the definition of the Green’s function in such a way 
that such complications do not occur. The mathematical formalism based on 
this definition, which was developed by T. Matsubara (1955), is especially 
suitable for calculating the thermodynamic quantities of a macroscopic system. 

We define Matsubara y operators by* 


UM(r, r) = ef’ 5 (r) e-h, | (37.1) 
Mer, r) = F Gi e”, 
where 7 is an auxiliary real variable; these operators formally differ from the 
Heisenberg ones in that the real variable t in the latter is replaced by the ima- 
ginary variable —it.* A similar change (WV — ¥™”, + — WM, id/at + — 8/6), 
for example in (7.8), gives the equations satisfied by the operators (37.1). With 
these operators, a new Green’s function @ is defined similarly to the definition 
of the ordinary Green’s function G in terms of the Heisenberg y operators: 


GauplT1, F15; T2, Yo) = —(T, Mr, rı) M(t, r2)}, (37.2) 


t In this section we shall write the formulae simultaneously for Fermi systems and for 
Bose systems (above the A-point). When there are alternative signs, the upper signs correspond 
to Fermi systems and the lower ones to Bose systems. The spin indices are to be omitted for 
Bose systems. 

~ It must be emphasized that, because of this change, the operator ¥™” is not the same 
as Yur 
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where T, is the “t-chronological operator”, which places the operators from 
tight to left in order of increasing t (with change of sign when operators are 
interchanged for Fermi systems); the brackets <. . .ẹ denote averaging over the 
Gibbs distribution. This averaging may be written explicitly if the definition 
(37.2) is expressed as 


Gap =—tr (0T, PMC) Par} = exp (FP), 7.3) 





where tr denotes the sum of all the diagonal matrix elements. This is called a 
temperature Green’s function, in contrast to the “ordinary” function G, called in 
this connection a time Green’s function. 

Like G,, Gg for a non-ferromagnetic system in the absence of an external 
magnetic field reduces to a scalar: G,, = @6,,. For a spatially homogeneous 
system, its dependence on r; and rs again reduces to a dependence on the dif- 
ference r = rı—r2. 

It is also easily seen that, by its definition (37.3), @ depends only on the differ- 
ence T = T;—T. For example, let Tı < T2; then’ 


G = tye tr {e-fir em alr) e- C247) & alr) emrin, 


or, with a cyclic interchange of factors in the trace, 


a= ty tr fers) Ê ot (r2) ef or}, t<0, (37.4) 


which makes evident the truth of the statement. 
The variable t in practice takes values only in a finite range 


~1T =t <1/T. (37.5) 


The values of G(r) fort < 0 and t > 0 are related in a simple manner. When 
T = Tı— T > 0, we find, similarly to the derivation of (37.4), 


G = ay eT tr fe Gn) eÈ Dira) 


and comparison with (37.4) gives 


GQ) =FG(r+1/T), t< 0; (37.6) 


from (37.5), the argument of the function on the right is positive when t < 0. 


t The factor 2 in parentheses applies to Fermi systems; it is to be replaced by 1 for Bose 
systems. 
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Let us now expand G(z, r) as a Fourier integral with respect to the coordi- 
nates and a Fourier series in t (over the range (37.5)) :* 


— (p .¥—¢,) P Pe 
GenaT FS [om acan os 31.7) 
for Fermi systems 
Es = (2s+1) aT, (37.8a) 
and for Bose systems 
ts = 2saT, (37.8b) 


s= 0, +1, £2, ...; the condition (37.6) is then automatically satisfied. The 
inverse transformation to (37.7) is 


GCs, p) = T f e~e G(r, r) dx dr; (37.9) 
0 


the integral over the range —1/T <t <1/T is converted into one from 0 to 
1/T, using (37.6) and (37.8). 

Calculations similar to those in §36 enable us to express G(C,, p) in terms of 
the matrix elements of the Schrödinger y operators, with the result 


Gt, p) = EÈ S wy Arn P— Kenn) 


(2) hn bo,  (1EeaT). (37.10) 


Hence we see, first of all, that 


G(—Ss, P) = Z*(Cs, p). (37.11) 


Next, comparing (37.10) with the expansions (36.6) and (36.20) for G®, we 
find that l 
GCs, P) = GRC, p), Cs >0. (37.12) 


The condition ¢, > 0 is due to the fact that the expressions (36.6) and (36.20) 
are immediately valid only in the upper half-plane of w, as explained in §36. 
Similarly, we find that G(¢,, p) = G4(iz,, p), ¢, < 0. Thus the temperature 
Green’s function in Fourier components is the same as the retarded or ad- 
vanced Green’s function at discrete points on the imaginary w-axis. In particu- 
lar, this result leads at once to an expression for the temperature Green’s func- 
tion in an ideal gas: replacing w by it, we find from (36.17) 


Gts p) = [eset] (37.13) 


In the next section, the diagram technique for calculating the function 
G (É, p) will be described. To determine G*(«, p) (and therefore, in particular. 


T This device is due to A. A. Abrikosov, L. P. Gor’kov and I. E. Dzyaloshinskii (1959) and 
E. S. Fradkin (1959). 
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to determine the energy spectrum of the system), we must construct an analytic 
function equal to G(C,, p) at the points œ = it, and having no singularity in the 
upper half-plane of w. This procedure is unique if we add the requirement 
that G¥(w, p) — Oas [œ| — œ ; see (36.11). Nevertheless, in specific cases such 
an analytical continuation may involve some difficulties. It is, however, unne- 
cessary in calculating the thermodynamic quantities. 

For example, to calculate the potential 2, we can start from the expression 
for the density matrix averaged over the Gibbs distribution, 


Noup(¥1, r2) = + Gap(T1, r1; T1+0, r2), (37.14) 
which is evident from the definition (37.2); cf. (7.17). Putting re = rı and sum- 


ming over « = £, we find as the density of the system 


N ay 
y= tr È || @G.mew -E (37.15) 


S =—0oo 





This expression determines N as a function of u, T and V, and Q(u, T, V) is 
then calculated by integrating the equation N = —ôQ/ðp. 


§ 38. The diagram technique for temperature Green’s functions 


The diagram technique for calculating the temperature Green’s function @ 
is established in a similar way to that in §§12 and 13 for the time function G. 
The fact that the definition of the Matsubara y operators (37.1) differs from 
that of the Heisenberg operators only in the formal replacement of it by Tt 
enables us to make considerable use of direct analogy. 

First of all, let us define the Matsubara operators in the interaction represen- 
tation; they differ from (37.1) in that the exact Hamiltonian A’ is replaced by 
the free-particle Hamiltonian A): 


P(r, r) = exp (tHo) $.(r) exp (— tH). (38.1) 


The relation between the operators ¥ and Y™ is given by the Matsubara 
S-matrix, constructed similarly to (12.8): 


G(T, Tı) = T; exp | = í V(t) dr) > (38.2) 
where j 
PAT) = exp (t) V exp (— thg) (38.3) 


is the interaction operator in that representation. But, whereas in §12 the rela- 
tion between Y and Po was established with the initial condition that the inter- 
action was “switched on” at t = — œ, the “initial” condition must now be 
that ¥” and ¥™ are the same at t = 0. Accordingly, we have instead of 
(12.11) 

YM (rt) = 6-17, 0) YM(z) G(x, 0). (38.4) 
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We substitute this expression in the definition (37.3) of the Green’s function; 
taking the particular case T; > Tə, we find 


Gast, %) = — tr {96-(14, 0) PMCT) 6(t,, 0) 6-t2, 0) PH (ts) 5(t2, O}; 
the arguments r; and rz are omitted, for brevity. Noting that, when Tı > T2 >Ts, 


G(T, T3) = G(T, T2) (To T3), 


G(T, T,)G—1(T3, T1) = G(T, T3), 
we obtain 


GaplT, Ta) = — ny wô- (7 , 0) k (F7 , n) WM (11) ôlTa, T2) PMCT) ê(T2 o|! . 


The factors in the square brackets are already in order of increasing t from right 
to left. We can therefore write 


Gaplty, 12) = — tr {(06-T, PM) PM) ôl}, (38.5) 


where 


ô = ô(1/T, 0). 


It is easily verified that in this form the expression remains valid for T} < Tə 
also. 

In contrast to (12.12), equation (38.5) contains an extra (Gibbs) factor, 
and the averaging is over states of a system of interacting particles. We shall 
show that these two differences cancel out, and a complete analogy with (12.14) 
exists. To do so, we use the formula 


eH — ete G(r, 0), (38.6) 


which is obtained by substituting (38.1) in (38.4) and then comparing the re- 
sulting expression with the definition of Y™ (37.1). By means of (38.6) we can 
substitute in (38.5) 


e*fIT 6-11 /T, 0) = e- ÎŚIT, 


The factor e®” is taken outside the trace, moved from the numerator to the 
denominator, and put in the form 


e-2IT — tre-Î®'T = tr e-®T6(1/T, 0). 


Lastly, multiplying the numerator and denominator by exp(2o/T), where 2o 
is the thermodynamic potential of an ideal gas for the same values of u, T and V, 
we find 


Galt 72) = - Gr (T, PMC) P(x) ôy (38.7) 
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where the averaging is with respect to the states of a system of non-interacting 
particles: 
(,..)9 = tr {W%...}. 


There is an evident analogy with (12.14). 

To change to the diagrams of perturbation theory, as in§13, we expand (38.7) 
in powers of the interaction operator V(t). For a system with pair interaction 
between particles, this operator differs from (13.2) only in that the Heisenberg 
operators W,, W+ are replaced by Matsubara operators YM, ÊM, The mean 
values of the products of y operators are again expanded by Wick’s theorem 
(i.e. by taking all possible ways of contracting pairs of operators); the validity 
of this theorem in the macroscopic limit is proved in this case by the same 
arguments as in §13. 

The rules of the diagram technique thus obtained are entirely analogous to 
the rules derived in §13 for T = 0. The graphical form of the diagrams is exactly 
the same. There is only a slight change in the rules for analytical reading of the 
diagrams. 

In the coordinate representation, each continuous line from point 2 to 
point 1 is associated with a factor —Q@(t1, 11; T2, r2) (with a minus sign). 
Each broken line joining points 1 and 2 corresponds to a factor — U(ri—re) X 
X 6(t1— T2). For all variables t and r of internal points in the diagram, there is 
integration over d3x through all space, and over dt from 0 to 1/T. 

In changing to the momentum representation, we must expand all functions 
@ in the form (37.7). After the integration with respect to all the internal 
variables r, a delta function appears at each vertex of the diagram, expressing 
the law of conservation of momentum (2p = 0). There is also at each vertex 
an integral of the form 


T i exp{—it(ls, + Cs: + s,)} at. 
0 


This integral is (from (37.8)) zero unless 22, = 0, and in this case is equal to 
unity. Thus the law of conservation of discrete frequencies is also satisfied at 
each vertex. Each continuous line is now associated with a factor -G9 
(Č. p); a closed continuous line again has a factor n(u, T), the ideal- 
gas density for given u and T. For each broken line there is a factor — U(q). 
There is integration and summation over all momenta and frequencies that. 
remain undetermined by the conservation laws at all vertices, in the form 


The coefficient of the whole diagram in —@,, is (— 1)” for Fermi systems, 
where L is the number of closed sequences of continuous lines in the diagram. 
For Bose systems, the coefficient is unity. 
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In these techniques also, of course, as when T = 0, we can make a partial 
summation and define various “blocks” in the diagram. In particular, we can 
determine the vertex part, which is expressed in terms of the two-particle Green’s 
function. This vertex part is related to @ by a Dyson equation analogous to 
(15.14). We shall not write out the corresponding formulae, whose derivation is 
entirely similar to that in the diagram technique for T = 0. 

When we make the transition to the case T = 0, the sums over s in the 
Matsubara diagrams become integrals over Ẹ, and the Matsubara techniques 
become very reminiscent of the ordinary ones described in Chapter II. There 
is a difference, however, in that for real ¢ the Matsubara functions are the same 
as the values of G? and G4 on the corresponding halves of the imaginary 
axis; see (37.11), (37.12). In changing to the ordinary technique for T = 0, 
we must also rotate the contour of integration until it becomes the real w-axis. 


CHAPTER V 


SUPERCONDUCTIVITY 


§ 39. A superfluid Fermi gas. The energy spectrum 


THE whole of the Landau theory given in Chapter I applies only to one class 
of Fermi liquids—those whose energy spectrum is not such as to lead to super- 
fluidity. This is not the only possible type of spectrum for a quantum Fermi 
liquid, and we shall now go on to consider Fermi systems with spectra of a 
different kind. The origin of such spectra and their basic properties can be most 
clearly perceived from a simple model which allows a complete theoretical analy- 
sis: a degenerate almost ideal Fermi gas with attraction between the particles.’ 

A slightly non-ideal Fermi gas with repulsion between the particles has been 
discussed in §6. At first sight, the calculations given there are equally valid 
whether there is repulsion or attraction, i.e. whether the scattering length a is 
positive or negative. In fact, however, for the case of attraction (a < 0) the 
ground state of the system thus found is unstable with respect to a certain 
rearrangement that alters its character and lowers its energy. 

The physical nature of this instability consists in a tendency of the particles 
to “pair” by forming bound states of pairs of particles lying near the Fermi 
surface in p-space and having equal and opposite momenta and antiparallel 
spins—the Cooper effect (L. N. Cooper 1957). It is noteworthy that this effect 
occurs in a Fermi gas, however weak the attraction between the particles. 

Because of this effect, the set of operators âp. â, used in the problem of a 
Fermi gas with repulsion, corresponding to free states of individual particles 
of the gas, cannot now serve as a correct initial approximation in perturbation 
theory.? Instead, we must use from the start new operators, which we shall 
seek in the form of linear combinations 


bp- = Uplp— + UpGty, +» | 
bys = Upâp+ — Vpâtp,— 
of the operators of particles with opposite momenta and spins; the suffixes + 


(39.1) 


t This problem is the basis of the theory of superconductivity due to J. Bardeen, L. N. 
Cooper and J. R. Schrieffer (1957). The method of solution given below is by N. N. Bogolyu- 
bov (1958). 

+ An indication of the inapplicability of perturbation theory (in the form used in §6) to 
pairs of particles with spin components + 3 and momenta p, ~ — p; is already given by the 
presence of the singularity at ® = x of the expression given by this theory for the quasi- 
particle interaction function (6.16); this singularity exists only with antiparallel spins, corre- 
sponding to the eigenvalue — 3 of the operator 61.62. 


153 


154 Superconductivity 


and — refer to the two values of the spin component. Because of the isotropy 
of the gas, the coefficients u, and v, can depend only on the absolute magnitude 
of the momentum p. In order that these new operators should correspond to 
the creation and annihilation of quasi-particles, they must obey similar Fermi 
commutation rules to those of the old operators: 


babi + bt Boe = 1, (39.2) 
and all other pairs of operators anticommute (the suffix « labels the two values 
of the spin components). For this to be so, the transformation coefficients must 
be such that 

u+ = 1; (39.3) 

u, and v, may be made real by a suitable choice of the phase factor. The inverse 
transformation to (39.1) is 

Âp+ = Upby, + Vpbtp, _, l 


(39.4 
Âp- = Upby_ = Vpbtp, +e ) 


For the same reasons (the predominant role of the interaction between pairs 
of particles with opposite momenta and spins), we shall retain in the second 
sum in the Hamiltonian (6.7) only terms in which pı = —p2 = p, pj = — Pz =p’: 


7 P at, 8 sf 
H = 2 pr Âg Aye. — V Gy + AZ y, — Gy, — Gps (39.5) 


again with the “coupling constant” g = 4h?\a|/m (the scattering length 
a =< 0). 

In subsequent calculations, it will again be convenient to use the customary 
procedure for avoiding the need to take explicit account of the constancy of 
the number of particles in the system: as a new Hamiltonian, we use the differ- 
ence A’ = Ĥ— uÑ, where 


N = ¥ Gg 


is the particle number operator; the chemical potential is then determined, in 
principle, by the condition that the mean value N is equal to the given number 
of particles in the system. 

We shall also use the notation 


Np = p?/2m— p. (39.6) 
Since u ~ p}/2m, we have near the Fermi surface 
nr = Ur(p— Pr); (39.7) 


where vp = pp/m. Subtracting uÑ from (39.5), we can thus write the initial 
Hamiltonian as 


A' = 5, np âk âp Ds âr Ay, ~ Gp, - Gp+- (39.8) 
P, P,P 
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Here we make the transformation (39.4). Using the relations (39.2) and (39.3) 
and the possibility of replacing the summation suffix p by —p, we obtain 


A’ = 29) nt E nug vp) (ph. Oy. + Bp. bo) 
p p 
+2 D npiptyl bits By, - +b», Br) -77 Be Bos (39.9) 


B, = ug bp, — by, — v2 by, bty, — + vptip(b_p, - b+), - — byt, by +). 


The coefficients u, and v, are now chosen from the condition that the energy E 
of the system be a minimum for a given entropy. The entropy is given by 
the combinatorial expression 


S = — Y, [p log npa + (1 — npa) log (1 —7,,)]- 


The condition stated is therefore equivalent to minimizing the energy for given 
quasi-particle occupation numbers npa 
In the Hamiltonian (39.9) the diagonal matrix elements are zero except for 
terms containing the products 
Ds Bye = nas Oye ba = =n 
Hence 


E = 25) 0,08 + Dg- +n) [E upo — no+ —Mp-)]?- (39-10) 


Varying this expression with respect to the parameters u, and using the rela- 
tion (39.3), we find as the condition for a minimum 


ôE 2 
auo rr —MNys. —Ny_) [2v — E(u — vp) Leyte (1 —Ny 4 — n-)| =0. 
Hence 
2N pupp = A(uz— vz), (39.11) 
where 4 denotes the sum 
As E È upt9(1 — p+ Mp) (39.12) 


From (39.11) and (39.3) we can express u, and v, in terms of n, and 4: 
už 1 (2 Np 
= —(1+—_+—-}. (39.13) 
A 2 | EEE 
Substituting these values in (39.12), we obtain an equation for 4: 


oy l— p+ — 7- _ 
Ve V(4+%) 
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In equilibrium, the quasi-particle occupation numbers are independent of the 
spin direction and are given by the Fermi distribution formula (with zero chemi- 
cal potential; cf. the last footnote to §1): 


np = Mp = My = [7+1]. (39.14) 


Changing also from summation to integration over p-space, we can write this 
equation in the form 
1 1—2n, d?p 
=g | — oes E l. 39.15 
25) PEND nh ee) 
Let us now analyse the relations obtained above. We shall see that 4 plays 
a basic role in the theory of spectra of the type under consideration. We shall 
first calculate its value 4o for T = 0. 
When T = 0 there are no quasi-particles, so that n, = 0 and equation (39.15) 
becomes 
g Axp? dp 
L ee ee 39. 
2(2xh)> ] (A+ nè) eae) 


We may note immediately that this equation certainly could not have a solution 
for Ao if g < 0, i.e. in the case of repulsion, since the two sides would then have 
opposite signs. 

The main contribution to the integral in (39.16) comes from the range of 
momenta where 4o < Up ! pp—p| < vppr ~ l, and the integral is logarith- 
mic; the smallness of Ao relative to u is confirmed by the result. Cutting off 
the logarithmic integral at some 4 = é ~ u, we havet 


~ 


[ RONE <a Le E ORO Vad / A 
J Motol prp vr J (Atna op ©” Ay 
Hence 
(gmpr/2n*h*) log (8/40) = 1, (39.17) 
or 
Ao = Eexp (— 2z?h3/gmpr) = Eexp(—z2h/2pr | al). (39.18) 


This expression may also be written 


Ay = Eexp (—2/gvF), (39.19) 


t When p>>py, 7,%p’, and the integral (39.16) as written diverges as p. In reality, however, 
this divergence is spurious, and is eliminated by renormalizing the relation between the con- 
stant g (i.e. the scattering length a) and the interaction potential, as in §§6 and 25. A consistent 
performance of this quite complicated calculation allows us to determine also the propor- 
tionality factor between the cut-off parameter @ and the chemical potential u: ë = (2/e)"®u 
= 0.49u (L. P. Gor’kov and T. K. Melik-Barkhudarov, Soviet Physics JETP 13, 1018, 
1961). 
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where vp = mpp/n?h? is the energy density of the number of states of a particle 
on the Fermi surface (v de is the number of states in the range de). 

The most interesting feature is the form of the energy spectrum of the system, 
i.e. the energy of the elementary excitations &,, = ¢, = e(p). We can find 
this from the change of energy of the whole system when the quasi-particle 
occupation numbers change, i.e. by varying E from (39.10) with respect to n,,. 
Since the values of u, and v, have already been taken from the condition that 
the derivatives of E with respect to them are zero, the variation of E with respect 
to n,, can be carried out with constant u, and v,. Then 


e = (ÔE/ônN pa)up, vp- 


The calculation of the derivative, using (39.11)-(39.13), leads to the simple 


result 
elp) = y (4? + n8). l (39.20) 


We see that the quasi-particle energy cannot be less than the value 4, which 
is reached when p = pp. In other words, the excited states of the system are 
separated from the ground state by an energy gap. The quasi-particles, having 
half-integral spin, must appear in pairs. In this sense we may say that the gap 
is 24. Since p, | a|/ň < 1, 4o is exponentially small with regard to u. Moreover, 
the expression (39.18) cannot be expanded in powers of the small parameter, 
the coupling constant g; the latter occurs in the denominator of the exponent, 
and so g = 0 is an essential singularity of Ao(g). 

The spectrum (39.20) satisfies the superfluidity condition established in §23: 
the minimum value of ¢/p is not zero. Thus a Fermi gas with attraction between 
the particles must have the property of superfluidity.* 

Figure 5 compares the dispersion relations of quasi-particles in a superfluid 
Fermi system (upper curve) and in a normal one. In the latter, the dispersion 


ig 
| 


A 


P-P- 
Fic. 5. 


t Note, however, that the Landau condition has different meanings for the Bose and Fermi 
spectra. Fot the Bose spectrum, violation of the condition would lead to unlimited growth of 
excitations and there could not be an equilibrium motion of the normal part relative to the 
superfluid part. (This is shown by the fact that the Bose distribution function is negative; see 
the first footnote to §23.) An unlimited growth of Fermi excitations is prevented by the Pauli 
principle, and the presence of a Fermi branch not satisfying the Landau condition need not 
imply the absence of superfluiditv, only that the normal part is present at T= 0. 
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relation is represented by the two straight lines € = vp'p— prl, in accordance 
with the treatment mentioned at the end of §1. 

The magnitude of the gap 4 depends on the temperature, i.e. the form of the 
spectrum itself depends on the statistical distribution of quasi-particles—a situ- 
ation analogous to that of a normal Fermi liquid. Since the quasi-particle 
occupation numbers increase (tending to unity) with increasing temperature, 
it is evident from (39.15) that 4 decreases, and becomes zero at some finite 
temperature T,, at which the system passes from the superfluid to the normal 
state. This point is a phase transition of the second kind, like the A-transition 
in a Bose superfluid. 

The presence of the energy gap in the spectrum of a degenerate Fermi gas is 
a manifestation of the “pairing”effect mentioned at the beginning of this sec- 
tion. The quantity 24 may be regarded as the binding energy of the Cooper 
pair, which would have to be expended in order to break it up. 

The Hamiltonian (39.5) takes account (as already noted in §6) of the inter- 
action only between pairs of particles in the singlet s-state: the orbital angular 
momentum of the relative motion of the particles is zero, and their spins are 
antiparallel. The pairs, having zero total spin, behave as Bose objects and may 
accumulate in any numbers at the level (of their motion as a whole) with the 
least energy, namely that for which the total momentum is zero. In this intuitive 
treatment, the phenomenon is entirely analogous to the accumulation of par- 
ticles in a state with zero energy (Bose-Einstein condensation) in a Bose gas; in 
this case the condensate is the ensemble of paired particles. 

The concept of bound pairs must not, of course, be taken very literally. It 
would be more precise to speak of a correlation between the states of a pair of 
particles in p-space, leading to a finite probability of the particles’ having zero 
total momentum. The spread dp of the momentum values in the correlation 
range corresponds to an energy of the order of A, i.e. dp ~ 4/vz. The cor- 
responding length  ~ #/dp ~ hv,/A determines the order of magnitude of 
the distances between particles with correlated momenta. When T = 0 this 
length, called the coherence length, is 


o ~ hur] 
~ (h/pr) exp (zň/2pr a!l). (39.21) 


Since, in a degenerate Fermi gas, fi/p, is equal in order of magnitude to the 
interatomic distances, we see that & is very large in comparison with these. 
This shows particularly clearly the conventionality of the concept of bound 
pairs. 

The origin of the Cooper effect is closely connected with the existence of the 
Fermi surface which bounds (in p-space) a finite region of occupied states at 
T = 0; an important point is that the energy density of the number of states 
on this surface is not zero. The relationship is evident in formula (39.19) for 
the gap Ao, which becomes zero as vp — 0. 
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§ 40. A superfluid Fermi gas. Thermodynamic properties 


We shall begin the study of the thermodynamic properties of a superfluid 
Fermi gas by calculating the temperature dependence of the energy gap. 
Rewriting (39.15) as 


i dp _ [ ndp 
“1458 AP = | seni i 


we note that the integral on the left differs from that for T = 0 only in that 
/lo is replaced by 4. Hence, using (39.17), we see that the left-hand side is 
(gppin/277h®) log(Ao/4). On the right we substitute n, from (39.14) and change 
to integration over dp = dy/ vp: 


lo z J dn = 


where 


e dx l 
= | er r * 
0 


on account of the rapid convergence of the integral, the limits of integration can 
be extended to + œ. 
At low temperatures (T < Ap) the integral is easilyt calculated to give 


A = Ali —V/(2nT/4o) e747]. (40.2) 


Near the transition point, 4 is small, and the leading terms in the expansion 
of the integral I(4/T) give* 


log (40/4) = log (aT! yA) + 70(3) A2/82T?. (40.3) 


Hence, first of all, we see that A is zero at a temperature 


t For large u, the first term in the expansion of J(u) in powers of 1 / is 
r dx ox 
I (u) = f y eP [-u (1+35)| 
0 


= (2/2u)"? e-*, 


* To expand the integral J(u) when u — 0, we add to and subtract from it the integral 
1 1 1 
h= — - — — 
1=> Í ( O s tanh 7 x) dx. 

Then 7 = I+ I, where 

1 e 1 1 1 1 

to | (amps end vote 
A 5 tanh y x-y tanh 5 Vixi+u)) dx. 

0 


In J, the first term in the integrand is integrated by elementary means, and the second is 
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Te = yAo/a = 0.5740 (40.4) 


which is small in comparison with the degeneracy temperature To ~ u. Then, 
in the first order in T,—T, we obtain 


A=T, lacy (1 -F))" = 3.06T, V(: -7;) , (40.5) 


It remains to calculate the thermodynamic quantities for the gas. Let us 


first consider the region of low temperatures. 
To find the specific heat in this region, it is simplest to start from the formula 


ÔE = Y e(ônp, +6ny_) = 2 }, côn, 
p p 


for the change in the total energy when the quasi-particle occupation numbers 
vary. Dividing by ôT and changing from summation to integration, we obtain 


the specific heat 


mpr if g 
C = V— Hs | aren 


— 00 


When T « A, the quasi-particle distribution function n ~ e~"7, and the quasi- 
particle energy € ~ lo +n?/240; a simple integration gives 


a/2mpr Aj? _ 
C= VSTi © 4o/T (40.6) 


integrated by parts, giving 


_ 1 1 r log x 
meneziz | ants A 


The integral is equal to 2 log (7/2y), where logy = C = 0.577 is Euler’s constant; thus 2/, = 
log (1/yu). 
The integral J, is zero when u = 0. The first term of its expansion in powers of 1” is 


— u? ze | ox (+ tanh z) 
x 
0 
Substituting the expansion 


tanh 4 x = 4x } [n%(Qn+1)?+x7}71, 
n=0 


derived as in the second footnote to §42, we obtain 


= 
Te 


47 F dx oo O LRO 
nad E, | aT E È enno = e BO, 





a 
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Thus, as T — 0, the specific heat decreases exponentially—a direct consequence 
of the presence of the gap in the energy spectrum. 

In subsequent calculations it is convenient to start from the thermodynamic 
potential 2, since the whole analysis is for a given chemical potential of the 
system, not for a given number of particles in it.t We use the formula 


(0.2/6A)r,v,, = (0H/@A), (40.7) 


where / is any parameter characterizing the system (cf. Part 1, (11.4), (15.11)); 
in this case we take as the parameter the coupling constant g, which appears 
in the second term in the Hamiltonian (39.8). The mean value of this term is 
given by the last term in (39.10), which by (39.12) is —VA2g oc g. Hence 


02Q/0g = —VA?/g?. 


As g — 0, the energy gap 4 tends to zero. Hence, integrating this equation with 
respect to g from 0 to g, we find the difference between the thermodynamic 
potential Q in the superfluid state and the value it would have in the normal 
state (4 = 0) at the same temperature :* 


A2 
Q,-Q,=—-V | = dg. (40.8) 


According to the general theorem of small increments (Part 1, (24.16)), the 
correction (40.8), when expressed in terms of the appropriate variables, is the 
same for all the thermodynamic potentials. 

At absolute zero 4 = Ao, and from (39.18) 


dAo/ dg = 2mh? Ao/mprg?. 


Changing in (40.8) from integration over dg to that over d4o, we find the fol- 
lowing expression for the difference between the ground-state energies of the 
superfluid and normal systems: 


E,—E, =—V as 42. (40.9) 





t The chemical potential of the gas itself is not to be confused with the zero chemical poten- 
tial of the quasi-particle gas. 

+ A comment is necessary here in connection with the approximations made throughout. 
When g = 0, no interaction between particles remains in the Hamiltonian (39.8), and one 
might suppose that we then have an ideal Fermi gas, not a “normal” non-ideal gas. In reality, 
however, approximations have already been made in the Hamiltonian (39.8), after which one 
cannot speak of calculating the absolute value of the energy. Interaction terms (which are not 
important in finding the form of the spectrum and the difference 2,—,) have been omitted 
whose contribution to the energy is large compared with the exponentially small quantity 
(40.8); this is the contribution proportional to Ng given by (6.13). 
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The negative sign indicates that, as mentioned at the beginning of this section, 
the “normal” ground state is unstable when there is attraction between the gas 
particles. The difference (40.9) per particle is ~ A?/p. 

Let us now take the opposite case, T — T,. Differentiating (40.3) with respect 


to g, we find 


From here we substitute dg/g? in (40.8), regarding it as the difference of free 


energies: 
4 


_ _y 16(3)mpr f 4s 
Fs — Fn = —V gare A dA 


and finally obtain, using (40.5), 


+ y2mprT? T \? 
F- F, = -V (1 7) (40.10) 


The difference of entropies is therefore 


= 4mprT, T 
Ss — Sp T V TOR ( zg) . 


As T — T „ the difference of specific heats tends to a finite limit, 


= 4mprT. 
Ci—Cn =V TR’ (40.11) 


i.e. there is a discontinuity at the transition point, with C, > C„. The specific 
heat of the normal state is given (in the first approximation) by the ideal-gas 
formula (see Part 1, (58.6)); expressed in terms of pp, it is C, = Vmp,T/3%. 
The ratio of specific heats at the transition point is therefore 

CAT.) 12 


CT) ~ FE *! = 2.43. (40.12) 





As regards its superfluidity, the gas is characterized by the division of its 
density ọ into normal and superfluid parts. According to (23.6), the normal part 
of the density is 

87 dn 
e ae ee 
On = — 300i? ae P 
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The total density of the gas is related to pr by 

e = mN/V = 8xphm/3(22h)°. 
Hence 


Qn __»f{ an 
v2 | Tan. (40.13) 
0 


This integral does not need to be calculated specially, since it can be reduced 
to the known function A(T). Differentiating (40.1) with respect to T and com- 
paring the resulting integral with (40.13), we see that 


D A 
P -FF (40.14) 
Substituting here the limiting expressions (40.2) and (40.5), we obtain 
$ On _ Ao 1/2 —d/T 
T+o: t= a en Au, (40.15) 
p @ 2 9()-2 
TT: & = 2(I z) (40.16) 


Lastly, two comments are needed concerning the range of temperature in 
which the above formulae are valid. 

As the transition point T, is approached, processes of interaction of quasi- 
particles (not taken into account in the above theory) become important; they 
are responsible in this case for the occurrence of the singularities of the thermo- 
dynamic quantities that are characteristic of phase transitions of the second 
kind. Sufficiently close to such a point, the formulae derived above must 
become invalid. However, because of the presence of a small parameter (the 
coupling constant g) in the model considered, this happens only for extremely 
small values of T,—T; we shall discuss this in more detail in §45. 

As in a superfluid Bose liquid, sound can be propagated in the Fermi gas 
under consideration (unlike one with repulsion; cf.§4), with a velocity u ~ p,/m 
determined in the usual manner by the compressibility of the medium. This 
means that, as well as the Fermi-type excitation spectrum dealt with here, the 
spectrum of such a gas also contains a phonon (Bose) branch. The specific 
heat due to phonons is proportional to T3 with a small coefficient, but as T — 0 
it must ultimately predominate over the exponentially decreasing specific heat 
(40.6). 
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§ 41. Green’s functions in a superfluid Fermi gas 


Let us now set up the mathematical formalism of Green’s functions for appli- 
cation to superfluid Fermi systems.’ 

We have seen in §26 that, in terms of y operators, the Bose-Einstein conden- 
sation in a Bose system is expressed by the existence of non-zero limiting values 
(as the particle number N —> œ) of the matrix elements between states that 
differ only in that N changes by unity. The physical significance of this statement 
is that the removal or addition of one condensate particle does not alter the 
state of a macroscopic system. 

For a superfluid Fermi system, the same must be true of the condensate of 
Cooper pairs: the state of the system cannot be altered when the number of 
pairs in the condensate changes by unity. This is expressed mathematically by 
the presence of non-zero limiting values (N — œ) of the matrix elements for 
the product ¥,(X2) P (X1), the two-particle annihilation operator, and its 
Hermitian conjugate, the pair creation operator ¥+(Xj) wy (X2). These matrix 
elements relate the “like” states of systems, differing only by the removal or 
addition of one pair of particles: 


lim (m, N| P(e) Pa(Xa) |m, N+2) 


= lim (m, N+2| P(X) Ë (X2)! m, NY # 0. (41.1) 


We shall henceforward omit the symbol for taking the limit, and for brevity 
also the diagonal matrix suffix m which labels the “like” states of systems with 
different numbers of particles. 

As with Bose systems (§31), the mathematical formalism of Green’s functions 
for superfluid Fermi systems involves several different functions. Together 
with the ordinary Green’s functions 


iGap( Xi, X2) = (N | TY(X1) PF (X2) |) (41.2) 
we need also the “anomalous” functions defined by 


iFap(X1, X2) = (N| TL(X1) PAX) N +2, 


iFei(Xi, Xa) = (N+2 ITA) PHX) IN) oe 


Since each of the functions F,, and F} is composed of two equal operators, 
Fap(X1, X2) = — Fax(X2,X1), Fen Xi, X2) = — Fa(X2, Xi). (41.4) 


The interchange of Fermi y operators with the factors in chronological order 
causes a change in the sign of the product. 


t The technique described in this section is due to L. P. Gor’kov (1958). 
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According to the fundamental principles of statistical physics, the result ofsta- 
tistical averaging does not depend on whether it is with respect to the exact wave 
function of a stationary state of a closed system or by means of the Gibbs 
distribution. The only difference is that in the first case the result of the averag- 
ing is expressed in terms of E and N, the energy and number of particles in the 
body; in the second case, in terms of T and u, the temperature and the chemical 
potential. The first method is the more convenient for the discussion that fol- 
lows in this section. 

In the model of a Fermi gas considered in §39, the bound pairs are in a singlet 
state. The spin dependence of the matrix elements of the creation and anni- 
hilation operators of such a pair reduces to a unit antisymmetric spinor: 


Bap = (a o): (41.5) 


The functions (41.3) may be written? 
Fig = 8apF(X1, X2), Fi = 8«pF+ (X1, X2); ' (41.6) 


from (41.4), F and Ft are symmetrical in X, and X2. The spin dependence of 
the Green’s function G,, for a non-ferromagnetic system reduces to a unit 
matrix 6,5: 

Gap = O2pG. 


In a homogeneous system macroscopically at rest, the Green’s functions G, F 
and F* depend only on the differences of the coordinates of the points and the 
difference of times (see the sixth footnote to §31). 

Just as the function 2(X) defined in §26 had the sense of a wave function for 
particles in the condensate, so the function iF(t, r1; t, r2) may be regarded as 
the wave function of particles bound in Cooper pairs in the condensate. Then 


the function 
5(X) = iF(X, X) (41.7) 


will be the wave function for the motion of these pairs as a whole. From the 
definitions (41.3) and (41.5) it is easily seen that then F+(X, X) = i&*(X). Ina 
stationary system macroscopically at rest, the function Z(X) reduces to a 
constant, which may be made real by a suitable choice of the phases of the y 
Operators. 

Let us now calculate the Green’s functions thus defined for the model of a 
Fermi gas with weak attraction between the particles. 

The Heisenberg y operator satisfies the equation (7.8). Because the range of 
the forces between particles in the gas considered is small, in the integral term 


t Cf. the fifth footnote to §7. Whereas G,¢ in its spin structure is a mixed spinor of rank two, 
Fg and Fg are contravariant and covariant spinors respectively. 
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in this equation we can take the factors Yt, r’) at the point r’ = r and bring 
them outside the integral. The equation then becomes‘ 


w, 
je =- (Sar +n) Pgh BW, (41.8) 


By taking the Hermitian conjugate of each term in this equation, we get a 
corresponding equation for the operator Ê+: 


p+ 
joe = (Sm n) Pt 4 phi PW, (41.9) 


Substituting (41.8) in the derivative 0G,,/0t (9.5), we obtain the equation 


0 A ; 
(iar tam +) Gul X — X') 


—ig(N|T P(X) F(X) F(X) PF (X)|N) = bp 5O(X—X); (41.10) 


cf. (15.12). The diagonal matrix element of the product of four y operators 
can be written out, by the matrix multiplication rule, as a sum of products of 
matrix elements of two pairs of operators. Of all such products, we keep only 
those containing matrix elements for transitions in which the change in the 
number of particles is N «+ N+ 2, and omit all other terms: 


(N| TY P P P NY ~ (N| TEL, |N4+2)(N4-2(T P BIN) 
= — F(X, X) F(X, X') = — 8,9 F(0) FH(X-X"); (41.11) 





the expressions (41.5) are used to derive the last formula. This term corresponds 
physically to the pairing of particles, and it has the same order of magnitude as 
the condensate density. 

We must emphasize, however, that there is a fundamental difference from 
the approximations used for a slightly non-ideal Bose gas. In the latter, almost 
all particles are in the condensate at T = 0, and the number of above-conden- 
sate particles, which occur only because of the weak interaction of the particles, 
is relatively small. In the present case, on the other hand, the condensate itself 
is due to the weak interaction, and therefore contains only a small fraction of 
the particles. In other words, the terms omitted in making the substitution 
(41.11) are large, not small, compared with those retained. The latter, however, 
give rise to a qualitatively new effect, a change in the nature of the spectrum, 
whereas the former would be needed only to calculate the corrections (which 
are of no interest here) to the ground state of the system; cf. the last footnote 
to §40. 


t As in §39, we use the notation g for the coupling constant,equal to —Uy = — f Ud?x. 
In §§41 and 42, we have put # = 1. 
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After the substitution (41.11), equation (41.10) becomes 


(i TAE Sm +h) G(X) + gEF*(X) = U(X); (41.12) 


the argument X—-X’ is replaced by X, and the constant iF(0) is denoted by £, 

in accordance with the definition (41.7). Here there are two unknown functions, 

G(X) and Ft(X), and another equation is therefore needed to calculate them. 
It may be found by calculating the derivative 


_OF(X—X' ov (X i 
PEMET) ae (N2 tS win NY; 


a delta-function term (similar to the second term in (9.5)) does not arise here, 
since the function F(X- X’), unlike G,,(X—X’), is continuous at t = f.t 
Substituting (41.9) and again separating the condensate term as in (41.11), we 
obtain the equation 
CEE ) F+(X)+82Z*G(X) = 0 (41.13) 
Ot 2m i : 

It contains the same two functions G and F* as (41.12); the two equations are 
therefore sufficient to calculate these functions. To calculate F, a further equa- 
tion would have to be derived in a similar way. 

In these equations we can change to the momentum representation by using 
the Fourier components G(P) and F*(P) in the customary way: 


(w— np) G(P)+g5F*(P) = 1, a (41.14) 
(w +np) F+(P)+8&"G(P) = 


where P = (w, p)and n, = p?/2m— u. Since F*(Y)is an even function, so are its 
Fourier components: F*(P) = Ft(—P). 
Eliminating F* from the two equations, we find the equation for G 


(w?—42— A?) G(P) = w+ np, (41.15) 


with the notation 
A=g|i. (41.16) 


The formal solution of (41.15) is 


_ w+ _  & v? 
GP) = aeo) = o) oF” Paan 


where e(p) = y (4? +n?) and u, and v, are given by (39.13). It is evident from 
this that the spectrum of elementary excitations, determined by the positive 


t This is easily seen by calculating the discontinuity of Fæ in the same way as for G,g in§9, 
and noting that the operators Pir, r)and Prt, r’) anticommute. 
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pole of the Green’s function, is given by e(p), and we recover the result (39.20). 
We also see that the energy gap 4 and the modulus of the condensate wave 
function for the motion of pairs as a whole are proportional to each other. 

The expression (41.17) for G(P) is, however, not yet complete, since the man- 
ner of passing round the poles has not been defined. That is, the imaginary 
part of G is not yet determined; it contains the delta function 6(w+«), and 
therefore disappears on multiplication by w?— e? in (41.15). 

When T = 0, the rule for passing round the poles is established by direct 
comparison of (41.17) with the expansion (8.7): in terms with positive and 
negative poles, the variable is to be replaced by w+i0 and w—i0 respectively; 
then (41.17) becomes 
eee o 

(®, p) = w— &(p)+i0 t w@ + &(p)—i0 


a @ + Np 
~ (w—e+i0)(w+e—i0) ` GS 


Now expressing Ft by means of the second equation (41.14), we find 


ere o see 
Fro, P) = eH i0)(@+e—10) ` AR 


But, by definition, 


i&* = FH(X = 0) = f f F+ P (41.20) 


Substituting (41.19), we integrate with respect to w by closing the contour 
with an infinite semicircle in the upper half-plane, and so express the integral 
in terms of the residue at the pole œw = e. Then, after cancelling £*, we find 
equation (39.16) for Ao. 

When T + 0, it is somewhat more complicated to determine the imaginary 
part of the Green’s functions. To construct G(w, p) with the correct analytical 
properties with respect to the variable w, we first write down the retarded func- 
tion G¥(w, p); it must be analytic in the upper half-plane, and is therefore 
obtained from (41.17) by the substitution œ - w+i0. The imaginary part of 
this function is 

im GÈ = —q[u2d(w — e) + v2d(w + e)]. 


The imaginary part of the required function G is found from this by means 
of (36.14), which gives 
im G(w, p) = tanh (w/2T) im GF (w, p) 
= — (1 — 2r) m[uZd(w — e) — 035(w + e)l, 


where n, is the Fermi distribution function (39.14); by using this formula, 
we change from averaging with respect to a given stationary state of the system 
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to averaging over the Gibbs distribution. The function G with this imaginary 
part may be written 


= p? tning ~ 8) — 038 1.21 
G(w, p) = eoe n we eco + 2nin,[u26(w — e) — v26(w +e)]. (41.21) 


We now find for the function Ft (a, p) 


F+(@, p) = [F+ (o, Plr- 0- 2 aoe) +8(o+6)}, (41.22) 


where the first term is the function (41.19), relating to T = 0. Substituting this 
expression in (41.20) and carrying out the integration, we return to equation 
(39.15) for A(T). 

Equations (41.14) can be put in diagram form, similarly to the representation 
of equations (33.7) for a superfluid Bose system. The functions G, F and F* are 
represented by the same graphical elements (33.6)—one-way and two-way arrows. 
The two equations (41.14) are written 


P P P -P P (41.23) 
-P P =P P 


A thin arrow corresponds to a factor iG(P), where G(P) is the Green’s func- 
tion of an ideal Fermi gas. The wavy lines entering and leaving a vertex corre- 
spond to factors ig and — ig" respectively. Comparing (41.23) and (33.7), we 
see that these latter factors correspond to the self-energy functions ios and i£ 
respectively, i.e. are first approximations to these quantities. The new elements 
(two-way arrows, wavy lines) are the only special features of the diagram tech- 
nique for superfluid Fermisystems; unlike the case of Bose systems, “triple” 
vertices do not appear. The diagram technique is therefore much simpler here 
and closer to the “ordinary” kind than for superfluid Bose systems. 


§ 42. Temperature Green’s functions in a superfluid Fermi gas 


In §41 we have determined the energy spectrum of a superfluid Fermi gas by 
using the ordinary time Green functions. However, in order to solve more 
complex problems (in particular, to investigate the properties of the system in 
external fields), it is more convenient to use the mathematical formalism of tem- 
perature Green’s functions (A. A. Abrikosov and L. P. Gor’kov 1958). 

The temperature function @,, is defined by the same formula (37.3) as for 
a normal Fermi gas. The temperature functions (F, and Fup (corresponding 
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to the time functions F,, and F;j,) will be defined by the analogous formulae 


(Fap(T1, 043 To, Yo) = 2 (m, N |T; MYM | m, N+2), 


42.1 
Fut, Fis To, Ty) = 2, (m, N+2 | WT, PMPM |m, N). ( ) 


The spin dependence of these functions is separated (as in (41.6)) in the form of 
factors gg: 


(Fap = Bap F, Fup = — gap T- (42.2) 


Like @, the functions (F and (F depend only on the difference t = t1—T2, 
and satisfy the relations (37.6) with the upper sign: 


Ft) =-Fe tT), Flr) =—Fle+ 1/7). (42.3) 
The Fourier series in t for these functions therefore contain only odd “frequen- 
cies” (37.8a):¢, = (2s+1) aT. 
The Matsubara y operators for t = 0 are the same as the Heisenberg opera- 
tors for t = 0: 
M(t = 0, r) = M(t = 0, r) 
Comparing the definitions of F, (F with those of F, F*, we thus find that 
FO, r; 0, r) = Fr), (FO, r; 0, r) = £*(r), (42.4) 


where & 1s to be understood as the condensate wave function averaged over the 
Gibbs distribution, i.e. expressed in terms of the temperature of the system. 

We shall show how the temperature Green’s functions may be used to obtain 
again the energy spectrum of a superfluid Fermi gas at non-zero temperatures. 

The equations for the temperature functions @, F, GF are derived in an exact- 
ly analogous way to equations (41.12) and (41.13); differentiation with respect to 
t replaces that with respect to t, and equations (41.8) and (41.9) are replaced by 
others which differ by the substitution of z for it. As in (41.11), we separate from 
the mean value of the product of four Matsubara y operators the terms con- 
taining matrix elements for transitions in which the number of particles changes 
by 2. The resulting equations are 


ð 
(- Or ope int) G(t,r;1' rT) +g BF (t, r; 7’ , r’) = Òt =r’) (r—r’), 


0 
eae Atn)? (F(T, £; t’, r) —g8* G(T, r; Tt’, r’) = 0. 
(42.5) 


After the change to Fourier components, these equations become 


(its — N) Gls P) +2EF Cs p) = 1, | (426) 
— (ils + Np) F (Css P) -—2E*G(Cs, p) = 0. 


t The different signs in the definitions of (Fand F (in contrast to the same signs in (41.5)) 
are appropriate because the factor 7 in (41.3) does not appear in (42.1). 
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The solutions are 


is +n 
Ges, p) m -are 9 (42.7) 
Flts P) = gE*/(C? +) = F+ (its, P), (42.8) 


where again £? = 4?+7?, A = g&; this solution is uniquely defined, and con- 
tains no, delta functions, unlike G and F*. 

The condition which determines the energy gap in the spectrum is now ob- 
tained from the equation 


= F(t =0,r = 0) = r Ž_ Flts, P) ay Joe 


or, after the substitution of (42.8), 
oo 3 
gT ap l (42.9) 


aF, 2.) E ~ 


The summation with respect to sis given by the formula‘ 


28 [(2s + 1) 7x? +a?]-1 = an tanh ya (42.10) 
and leads to 
1 1 e dP _ 
Le | Stank or Gos = L (42.11) 


in agreement with (39.15). 


§ 43. Superconductivity in metals 


The phenomenon of superconductivity in metals is a superfluidity of the 
electron Fermi liquid in them, similar to that of the degenerate Fermi gas con- 
sidered in the preceding sections. Of course, in many important respects the elec- 
tron liquid and the Fermi gas are quite different physical systems. The basic 
physical aspects of the energy spectrum are, however, the same for both. Let us 
examine qualitatively which features of the above model can be applied to 
electrons in metals, and to what extent. 


t This may be derived by writing 
esse Soe ee os l ] 
(2s+1)? n?+a? 2a | a+in(2s+1) © a—in(2s+1) 


1 
2 

L az f en zje- + DE 1 ens + D2] dy 
0 


and summing the geometrical progression before integrating. 
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An important property of a metal is the anisotropy of its electron energy 
spectrum, in contrast to the isotropy of the spectrum for the Fermi gas consid- 
ered above. This does not, however, prevent the occurrence of the Cooper effect, 
which depends only on the existence of a sharply defined Fermi surface (of 
whatever shape) and a finite density for the number of states on that surface. It is 
also necessary that electrons with opposite momenta and spins should have the 
same energy, i.e. Should both be on the Fermi surface. This condition automati- 
cally follows from symmetry under time reversal. We may say that the electrons 
are paired in states that are obtained from each other by time reversal. 

Next, there is the question of the sign of the interaction of the electrons in a 
metal. In a very simplified way, we may say that this interaction is made up of 
the Coulomb repulsion, screened at interatomic distances, and the interaction via 
the lattice. The latter is describable as resulting from the exchange of virtual 
phonons, and is attractive (§64). If this interaction preponderates, the metal will 
be a superconductor at sufficiently low temperatures. 

It is important to note that the interaction by phonon exchange involves only 
electrons in a comparatively thin shell of p-space near the Fermi surface, whose 
thickness (~ hap, where wp is the Debye frequency of the crystal) is small in 
comparison with the electron chemical potential u. Hence, if we describe the 
superconductivity by a model of a slightly non-ideal Fermi gas, the cut-off pa- 
rameter é in (39.19) is to be taken as 


instead of ë ~ u. 
As to the assumption regarding the weakness of the interaction, we in fact 
have for all actual superconductors 


T. « hwp « u. (43.2) 


The assumption made in §39, however „embodies something further, namely that 
the coupling constant g is small, and therefore that the dimensionless exponent 
in (39.19) is large. In the present case, this condition is expressed as 


log (hwp/T.) > 1; (43.3) 


not only the ratio iw)p/T, but also its logarithm must be large. In practice, this 
condition is considerably less well satisfied.* 

When all the actual differences between the electron liquid in a metal and the 
model of a slightly non-ideal Fermi gas are taken into account, the theory of 
Superconductivity becomes very complicated. It is, however, found that even 
a simple theory based on this model gives in many respects a good description 


t This, incidentally, eliminates the problem of the divergence of the integral (39.16) for 
large momenta (cf. the last footnote to §39). 
t The ratio fw p/T, varies between about 10 for lead and 300 for aluminium and cadmium. 
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of the properties of superconductors, both qualitatively and quantitatively. As 
already mentioned, this theory is due to Bardeen, Cooper and Schrieffer; the 
model of a Fermi gas with weak attraction between the particles is therefore 
known as the BCS model. 


§ 44. The superconductivity current 


The two types of motion in an electrically neutral superfluid (liquid helium) 
correspond, in a superconducting metal, to two types of electric current that 
can simultaneously flow in it. The superconductivity current transfers no heat 
and involves no dissipation of energy; it can exist in a system in thermodynamic 
equilibrium. The normal current is associated with the evolution of Joule heat. 
We shall denote the two current densities by j, and j,; the total current density 
j=i,+i,- 

Several important conclusions about the properties of the superconductivity 
current can be drawn regardless of any particular model, simply from the exist- 
ence of a new macroscopic quantity, the condensate wave function 5(t, r). 

Asin §26, we use the phase ® of this function: 


hy 


(t, r) = |B ei, (44.1) 


Just. as, in liquid helium, the gradient of Ø determines the velocity v, of the 
superfluid flow by (26.12), so in a superconductor the gradient of the phase 
determines the observable quantity, the superconductivity current density. Be- 
cause of the anisotropy of the metal, the direction of j, does not in general coin- 
cide with that of v®, and the components of these vectors are related by a ten- 
sor of rank two. To avoid inessential complications, however, we shall here 
consider only a metal crystal having cubic symmetry. 

The tensor then reduces to a scalar, and there is simple proportionality be- 
tween j, and V®, which may be written 


js = (eh/2m)n, V®. | (44.2) 


Here, by definition, e = —|e| is the electron charge and m its (actual) mass. 
The quantity n, thus defined, a function of temperature, is called the number 
density of superconducting electrons, and acts here as an analogue of the density 
of the superfluid component in liquid helium. It must be emphasized that this is 
not the same as the density of the condensate of Cooper pairs, just as in liquid 
helium g, is not the density of condensate atoms.' 


t The coefficient in (44.2) is written in such a way that in a free superfluid Fermi gas (BCS 
model) mn, is equal to ọ, as calculated in §40. The latter is defined so that the current j, must 
be expressed as en,v,, where v, is the velocity of superfluid motion. In turn, v, is related to the 
phase gradient by v, = (f/2m) V®; twice the mass occurs here (instead of m as in (26.12)) 
because the condensate consists of paired particles. 
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Formula (44.2), like (26.12) for liquid helium, presupposes that the phase varies 
sufficiently slowly in space. However, in a Bose liquid Ø had to vary only slightly 
over interatomic distances, but the condition here is considerably stronger. 
The characteristic dimension for a superfluid Fermi liquid is the coherence 
length o ~ fivg/Mo, and the phase® must vary only slightly over this distance, 
which is large in comparison with interatomic distances. We must emphasize 
that this is a constant (not temperature-dependent) length parameter $o. A rig- 
orous justification of the above condition will be given later (see the end of §51). 

The relation between j, and® becomes more complicated if the superconduc- 
tor is in an external magnetic field. We shall consider here the case of a field 
constant in time. The necessary changes in formula (44.2) can be ascertained 
from the condition that the theory is gauge-invariant. 

This condition states that all observable physical quantities must remain un- 
changed by a gauge transformation of the vector potential of the magnetic 


field: 
A~+A+x(r), (44.3) 


where y(r) is an arbitrary function of the coordinates. The y operators are trans- 
formed in the same way as the wave functions: 


$ — P exp (ieg/hħc), + ~ 2+ exp(—ieyx/fic), (44.4) 


where e is the charge of the particles described by the y operator; see OM, 
(111.9). The Green’s functions G(X, X’) and F(X, X’), as matrix elements of the 
products ¥Y"* or P$”, are transformed according to 


G, X) = exp | Fae) — x1} G, X), 
(44.5) 


F(X, X’) > exp ea [x(r) szen} F(X, X’). 


Here 
& = iF(X, X) + exp (2iex/hic) Z, 


i.e. the phase of the condensate wave function 
D — D+ (2e/hc) x(t). (44.6) 


The relation (44.2) is not invariant under such a phase transformation. To 
obtain the required invariance, this relation must include a further term contain- 
ing the vector potential of the magnetic field: 


eh 


j; = Im” (70-74) . (44.7) 


fic 


t Since the y operators appear in the second-quantized Hamiltonian (7.7) as pairs W(X) 
and P+ (x), it is transformed by the changes (44.3), (44.4) in the same way as the ordinary 
Hamiltonian for a similar transformation of ordinary (not operator) wave functions. A trans- 
formation in the form (44.3), (44.4) has in fact already been used in §19. 
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The doubling of the charge in this term corresponds to the pairing of electrons 
in the superconductor. 

This expression is now sufficient to account for the fundamental macroscopic 
property of a superconductor; the displacement away from it of a magnetic 
field (the Meissner effect).* 

Let us consider a homogeneous superconductor in a magnetic field that is 
weak compared with the critical field H, at which the superconductivity is lost. 
This condition excludes any significant influence of the magnetic field on the 
value of n,. Let the body be in a state of thermodynamic equilibrium, so that 
there is no normal current and j, = j.* Taking the curl of both sides of (44.7) 
and noting that curl A = B, the magnetic induction in the body, we get the 
London equation 

curl j = —(e?n,/mc) B (44.8) 


(F. and H. London 1935).8 
This equation is specific to superconductors. We shall also make use of the 
general Maxwell’s equations 


curl B = (4z/c) i, (44.9) 
divB = 0. (44.10) 


Substituting j from (44.9) in (44.8) and noting that, from (44.10), curl curl B = 
= — AB, we obtain an equation for the magnetic field in a superconductor: 


AB = B/8?, (44.11) 


where 
62 = mc?/4xen,. (44.12) 


We can use (44.11) to find the field distribution near the surface (assumed 
plane) of a superconductor. The surface is taken as the yz-plane, with the x-axis 
into the body. In these conditions, the field distribution depends only on one 
coordinate, x, and (44.10) gives dB,/dx = 0; then, from (44.11), we necessarily 
have B, = 0. Equation (44.11) now becomes @?B/dx? = B/d?, whence 


B(x) = $e-*", (44.13) 


where the vector § is parallel to the surface. 

We see that the magnetic field decreases exponentially into the superconduc- 
tor, penetrating only to distances ~ 6. This distance is macroscopic, but small 
compared with the usual dimensions of solid objects (6 ~ 1078-1075 cm), and 


t See ECM, Chapter VI, for the phenomenological electrodynamics of superconductors. 

+ This will be assumed throughout the rest of Chapter V, and j will therefore everywhere 
denote the superconductivity current density. 

$ The derivation of (44.8) given here is due to L. D. Landau (1941). 
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so the field actually penetrates only into a thin surface layer. The distance 6 is 
called the London penetration depth of the field. We must emphasize that it is 
directly measurable and has an entirely definite meaning, unlike the convention- 
al significance of the parameter n, 

The above derivation requires an important proviso, however. The original 
formula (44.7) is valid only if all quantities vary sufficiently slowly in space: the 
characteristic distances over which they vary considerably must be large com- 
pared with the coherence length £o. In the present case, this means that we must 


have 
ô >> Èo (44.14) 


This requirement, of course, does not affect the proof that the field is dis- 
placed from the superconductor: to suppose that the field is not displaced would 
lead to a logical contradiction, since it would then certainly vary slowly and 
equation (44.11) would be valid, but the specific equation (44.11) and the result- 
ing law of field decay (44.13) are valid only if (44.14) is satisfied. 

A superconductor in which the inequality 6 >> &o is satisfied is called a London 
superconductor; the opposite case, with 6 << &o, is called the Pippard case (the 
field decay in the superconductor is then of the kind to be discussed in §52). As 
T > T,„ the superconducting electron density n, — 0, so that 6 — oo. Thus we 
always have the London case sufficiently close to the transition point. As T — 0, 
however, the relation between ô and & depends on the specific properties of 
the metal.* 

Lastly, let us consider a further consequence of equation (44.7) that is inde- 
pendent of the relation between 6 and o. As we know from the macroscopic 
electrodynamics of superconductors, if there is a magnetic flux linking a super- 
conducting torus, it remains constant regardless of any changes in state of the 
body (if these do not destroy its superconductivity). Here we assume that the 
torus has a diameter and thickness large compared with the coherence length 
and the field penetration depth. We shall show that the magnetic flux “frozen” 
in the aperture of the torus can only be an integral multiple of a certain “flux 
quantum” (F. London 1954). 

Within the body (beyond the range of penetration of the field) the current 
density j = 0; the vector potential, however, is not zero, but only its curl, i.e. 
the magnetic induction B. We take any closed contour C embracing the aper- 
ture of the torus and passing through the torus far from the surface, so that the 
condition for equation (44.7) is satisfied, namely the slowness of the spatial 
variation of the phase ® and the potential A. The circulation of the vector A 


t The induction B itself is the true microscopic strength of the magnetic field, averaged over 
physically infinitesimal volume elements that are large only in comparison with the lattice 
constant. 

? The London case occurs at all temperatures, for example, in pure transition metals and 
in some intermetallic compounds. The Pippard case occurs (far from 7,) in pure non-transi- 
tion metals. 
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along the contour C is equal to the flux of the magnetic induction through a sur- 
face spanning the contour, i.e. the flux ¢ through the aperture of the torus: 


$A.dl = [curlA.df = [B.df = ¢. 


On the other hand, equating (44.7) to zero and integrating it along the contour, 
we have 


hic fic 
faa = z PTP. = 5, 


where ô® is the change of phase of the wave function on passing round the 
contour. Since this function must be one-valued, it follows that the phase change 
can only be an integral multiple of 2x. Thus we have the result 


p = no bo = ahc/|e| = 210-7 G.cm?, (44.15) 


where n is an integer. The quantity ġo is the quantum of magnetic flux. 

The quantization of the magnetic flux has another aspect: it causes the values 
of the total current J that can flow along a superconducting ring (in the absence 
of an external magnetic field) to be discrete. This current J creates a magnetic 
flux through the ring equal to LJ/c, where L is the self-inductance. Equating 
this to no, we find as the possible values of the current 


J = chon/L = nňc?n/| e| L. (44.16) 


In contrast to the magnetic flux quantum, the “quantum of total current”, like 
the self-inductance L, depends on the shape and size of the ring. 


PROBLEM 
Determine the magnetic moment of a superconducting sphere with radius R « 6 in a mag- 
netic field, in the London case. 


SoLuTION. When R « 6, the magnetic field within the sphere may be regarded as constant 
and equal to the external field 9. If the vector potential is taken in the form A = 19 xXr, we 


can put simply esas 
= —(n,e?/mc) A, 


i.e. take ® = 0 in (44.7): the boundary condition for the normal component of the current to 
be zero on the surface of the sphere (n.j = 0) is then automatically satisfied. The magnetic 
moment is calculated as the integral 


1 
M = -z | txiav 


over the volume of the sphere, and is 
= — R® §/306*. 
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§ 45. The Ginzburg-Landau equations 


The complete theory of the behaviour of a superconductor in a magnetic 
field is very complex. However, the position is considerably simpler in the 
temperature range near the transition point. Here it is possible to set up a sys- 
tem of relatively simple equations, valid in both weak and strong fields.* 

In the general Landau theory of phase transitions of the second kind, the 
difference between the “unsymmetrical” and the “symmetrical” phase is de- 
scribed by the order parameter, which is zero at the transition point (see Part 1, 
§142). For a superconducting phase, the natural order parameter is the conden- 
sate wave function £. To avoid complications that are in principle unnecessary, 
we shall assume that the metal crystal has cubic symmetry; as already men- 
tioned in §44, the superconducting state is then characterized by a scalar quan- 
tity n,, the superconducting electron density. A more convenient choice as the 
order parameter in this case is a quantity y that is proportional to Æ but is 
normalized by the condition |? = 4n, The phase of is the same as that 
of 2: 


y = y (ens) el. (45.1) 


The superconductivity current density (44.2), expressed in terms of y, is 


ieħ * * 
© = ~ 5 (p*vp— yyy"). (45.2) 


£ 
r 


The starting-point of the theory is the expression for the free energy of the 
superconductor as a functional of y(r). In accordance with the general ideas of 
the Landau theory, this is found by expanding the free energy density in powers 
of the small (near the transition point) order parameter p and its derivatives 
with respect to the coordinates. As a first step, let us consider a superconductor 
in the absence of a magnetic field. 

In accordance with its significance as a quantity proportional to the Green’s 
function F(X, X) = —i&(X), the order parameter y is not unique: since F(X, X) 
is constructed from two operators Y, an arbitrary change of phase of these 
operators, Y — Pe? causes a change of phase of the function F by «. Phys- 
ical quantities, of course, must not be affected by this arbitrariness, i.e. must 
be invariant under a transformation of the complex order parameter y — pe”. 
This excludes odd powers of y in the expansion of the free energy. 

The specific form of this expansion is established by the same considerations 
as in the general theory of phase transitions of the second kind (see Part 1, 
§146). Without repeating the arguments, we can write down the following ex- 


t The theory given below is due to V. L. Ginzburg and L. D. Landau (1950). It is note- 
worthy that this theory was constructed phenomenologically, before the microscopic theory 
of superconductivity. 
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pansion of the total free energy of a superconducting body:* 
h2 1 
= ELON 2 CS as 4 A 45.3 
F r+ (a (vp ž+alyi+yb! yit ay (45.3) 


Here F, is the free energy in the normal state (i.e. for y = 0); b is a positive 
coefficient depending only on the density of the substance (not on the temper- 
ature); a is a function of the temperature given by 


a =a(T-T), (45.4) 


and is zero at the transition point; the coefficient « > 0, in accordance with the 
fact that the superconducting phase corresponds to the range T < T,; the 
coefficient of | Vy |? in (45.3) is chosen so that the expression (45.2) is obtained 
for the current (see below). The fact that (45.3) contains only the first deriva- 
tives of y is the result of assuming sufficient slowness of spatial variation of y. 
In a homogeneous superconductor, with no external field, the parameter y is 
independent of the coordinates. Then the expression (45.3) reduces to 


F = F,+aV |p|?+ibV |p|. (45.5) 


The equilibrium value of | y/? (for T < T,) is determined by the condition for 
this expression to be a minimum: 


lp |? = —a/b = &(T.—T)b; (45.6) 


the superconducting electron density, as a function of temperature, decreases 
linearly to zero at the transition point. 

Substituting (45.6) back into (45.5), we find the difference in the free energies 
of the superconducting and normal states: 


F,—F,, = —V(a?/2b)(T.-—T)*. (45.7) 


From this, by differentiating with respect to the temperature, we can find the 
difference in the entropies, and then the discontinuity in the specific heat at the 


transition point :§ 
C.-C, = VoeTJb. (45.8) 


t We shall only mention again that this form of the gradient term depends on the assump- 
tion that the crystal has cubic symmetry. With lower symmetry, it would have a more general 
quadratic dependence on the derivatives 0y/0x,. 

+ This choice (including the identification of m with the actual mass of the electron) has, 
of course, no deep significance, and is conventional to the same extent as the definition of n,. 
in (44.2). 

$ Comparison of (45.6) and (45.8) for |y |? = 0,/2m and for the discontinuity in the specific 
heat with (40.16) and (40.11) for the same quantities in the BCS model gives the values of the 
coefficients « and b in that model (L. P. Gor’kov 1959): 


& = 672°7,/70(3) u = 7.047,/u, b = aT,/n; 
here we have used the relation between the particle number density n = @/m, the chemical 
potential u (at T = 0) and the limiting momentum for an ideal gas: 
n = p}/30°h®, u = ph/2m. 
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Near the transition point, the difference (45.7) is a small addition to the free 
energy. According to the theorem of small increments (Part 1, §15), the same 
quantity, expressed as a function of temperature and pressure, instead of 
temperature and volume, gives the difference in the thermodynamic potentials, 
®,—®,. On the other hand, according to a general formula in the thermody- 
namics of superconductors (see ECM, (43.7)), this difference is — V H?/82, where 
H, is the critical field which destroys the superconductivity. Thus we find for 
this field the following temperature dependence near the transition point :* 


H, = (4na? b)? = (4rca?/b)¥2(T.—T). (45.9) 


When a magnetic field is present, the expression (45.3) for the free energy 
has to be modified in two ways. Firstly, the magnetic field energy density 
B?/87 (where B = curl A is the magnetic induction in the body) has to be added 
to the integrand. Secondly, the gradient term has to be changed so as to satisfy 
the requirement of gauge invariance. In the previous section it has been shown 
that this condition makes it necessary to replace the gradient V® of the conden- 
sate wave function phase by VØ —2eA /ñc. In the present case, this means making 
the substitution 


vy =e? J | y| +iy 7D > Vy—(2ie/fic) Ay. 


Thus we have the following basic equation: 


2 2 r, 2 
F= Foot | Ret ape : (v ~Fe4) P| talpterolyilay, (45.10) 
where F is the free energy of the body in the normal state in the absence of 
the magnetic field. It must be emphasized that the coefficient 2ie/fc in this 
expression is not arbitrary (in contrast to the above-mentioned conventional 
choice of the coefficient 42/4m). The doubling of the electron charge is due to the 
Cooper effect (L. P. Gor’kov 1959); this coefficient could not, of course, be 
found by purely phenomenological means. 

The differential equations which determine the distribution of the wave 
function y and the magnetic field in a superconductor are now found by mini- 
mizing the free energy as a functional of the three independent functions y, 
y* and A. 

The complex quantity y is a set of two real quantities, so that y and y* must 
be regarded as independent functions in the variation. Varying the integral 


t In the BCS model, 
H, = 2.44 (mp,/h®)"*(1,-T) as T > T,. 


In the same model at T = 0, 
H, = 0.997, (mpp/h®)¥?, 


as is found by equating — VH?2/8z to the energy difference (40.9). 
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with respect to y“ and integrating by parts in the integral of the term (Vy— 
2ieA/hic) dy", we find 


w h2 2ie ,\? r ‘ 
orm ff- ar (9-404) ptap+b|y| pla dv 


h2 2ie Eies 
+ (vv -5 Ay) ôy* df; (45.11) 
the second integral is taken over the surface of the body. Putting ôF = 0, 
we obtain as the condition for the volume integral to be zero for arbitrary dy" 


z ihv- A ; +ap+b\yp|?y = 0; (45.12) 
varying the integral with respect to y gives the complex conjugate equation, 
and therefore nothing new. 

Similarly, varying the integral with respect to A gives Maxwell’s equation 


curl B = (4x/c)j, (45.13) 


and the current density is 
ee eee 
j=- zp C YV) -i ere (45.14) 


which agrees with (44.7); we have written j for j,, since in thermodynamic equi- 
librium there is no normal current. From (45.13) we have the equation of conti- 
nuity div j = 0, which may also be obtained,by direct differentiation of (45.14), 
using (45.12). 

Equations (45.12)-(45.14) form the complete set of Ginzburg-Landau equa- 
tions. 

The boundary conditions on these equations are found from the condition 
that the surface integrals in the variation ôF are zero. Thus we get from (45.11) 
the boundary condition 


n.(—it vy Ay) = 0, (45.15) 


where n is the normal vector at the surface of the body. As a result of this con- 
dition, the normal component of the current (45.14) is also zero, as it should be: 
n.j = 0.1 


t With the boundary condition (45.15), y itself is not zero, as the wave function apparently 
ought to be at the boundary of the body. This is because y actually falls to zero only at dis- 
tances ~ $ from the surface, but such distances are regarded as negligible in the Ginzburg- 
Landau theory. 

The condition (45.15) has been derived here essentially for a superconductor-vacuum 
boundary. It remains valid for a boundary with an insulator, but it is not correct for an inter- 


182 Superconductivity 


The boundary conditions for the field are as follows. From equation (45.13), 
since j is finite in all space (up to the surface of the body), the tangential com- 
ponent B, of the induction is continuous. The equation div B = 0 shows that 
the normal component B, of the induction is continuous. Thus the boundary 
conditions require the continuity of the whole vector B. 

In a weak magnetic field, we can neglect the influence of the field on |y |2, and 
take the latter to have the value (45.6) at all points in the body. Then the substi- 
tution of (45.14) in (45.13), followed by taking the curl of both sides, gives the 
London equation (44.11), with penetration depth 


mc2b 712 mcb 1/2 
ja Ps] sl aar | i (30) 


The Ginzburg-Landau equations contain another characteristic length be- 
sides this: the correlation radius of the fluctuations of the order parameter p 
(in the absence of the field), which we denote by £(7). From the formulae of 
fluctuation theory (see Part 1, §146), this radius is expressed in terms of the 
coefficients in the free energy (45.3) by 


EIT) = h/2(m | a|)*? 
= h/2(ma)/2(T,-T)"?. (45.17) 


The characteristic lengths (45.16) and (45.17) determine the order of magni- 
tude of the distances over which there is a significant change in the order param- 
eter y and the magnetic field, as described by the Ginzburg-Landau equations. 
The length ô is in general characteristic of the magnetic field, and £(7) of the 
distribution of y. Both these lengths must be large in comparison with the 
“dimensions of the pair” £o, in order to satisfy the assumption that all quantities 
vary sufficiently slowly in space. Since both lengths increase as the transition 
point is approached (in proportion to(T,—T)~*”), this condition is in general 
satisfied near the transition point (see below). 

In the theory given here, the Ginzburg-Landau parameter is important; it is 
defined as the constant (temperature-independent) ratio of the two lengths: 


x = A(T)/E(T) = mcb¥2/(2n)? | e| h. (45.18) 


face between different metals (one superconducting, the other normal), since it does not take 
into account the partial penetration of superconducting electrons into the normal metal. In 
this case, (45.15) is replaced by a more general condition compatible with n.j = 0: 


i (-i vy- Ay) Si. (45.15a) 


where A is a real constant (with the dimensions of length); however, an estimate of this con- 
stant would need a more detailed microscopic investigation. 
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In order of magnitude, x ~ 80/fo, where £o is the coherence length (39.21), and 
do is the London penetration depth at absolute zero. There is also a formula 


x = 24/2] e |/ħe) HAT) 54), (45.19) 


obtained from (45.9) and (45.16), expressing x directly in terms of observable 
quantities. 

Having established the form of the equations, let us now consider their range 
of applicability. 

At low temperatures, this range is in any case limited by the condition 
T,—T «T,, enabling the order parameter to be regarded as small, which is 
thus fundamental to the expansion that has been obtained for the free energy. 
The same condition ensures that (T) >> čo, but it is not strong enough to ensure 
that 6(7) >> o in superconductors for which the parameter x is small;‘ in such 
cases, the inequality 6 >> gives the condition 


T.—T «xT. (45.20) 


As T — T, the validity of the equations is limited only by the general con- 
dition for the validity of the Landau theory of phase transitions, relating to 
the occurrence of fluctuations in the order parameter. In the present case, 
however, this condition is extremely weak: it is expressed in terms of the coeffi- 
cients in the expansion (45.3) by the inequality 


T.—T >> b°T2/a(h?/m)> 


(see Part 1, (146.15)). For instance, an estimate of the expression on the right, 
using the values of b and « in the BCS model, gives 


(Te — T/T.: >> (Te 0}. (45.21) 


Since the ratio T„/u ~ 1073-1074 is very small, we can regard this condition 
as satisfied almost up to the transition point itself. The fluctuation region for 
the transition of the second kind between the superconducting and normal 
phases practically disappears. 


PROBLEM 


Find the critical magnetic field (parallel to the film plane) which destroys the superconduc- 
tivity for a plane film with thickness d « &, ô (V. L. Ginzburg and L. D. Landau 1950).? 


SOLUTION. We take the median plane of the film as the xz-plane, with the x-axis in the di- 
rection of the field. In equation (45.13) for the field B = B,(y) (which varies along the y-axis 
normal to the film), we can take y = constant. Then the first term in the expression (45.14) for 


t As examples, the values of x for some pure metals are: aluminium 0.01, tin 0.13, mercury: 


0.16, lead 0.23. 
+ See §47 for the corresponding problem of a small sphere. 
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the current is zero, and taking the curl of (45.13) gives B” = 67B/65?, where 0 =¢/Po, yi=lal/b. 
The solution of this equation, symmetrical in y, is 


_ « cosh (y0/ô) y- Gd 
BY) = 9 oh (40/20) ~ 6 [1+ 38" -e], 


where § is the external field. This corresponds to the current distribution 
j = j, =—cB [4a = —c¥ Oy/4n6?, 


In equation (45.12), however, we cannot completely neglect the dependence of y on y: the 
small derivative d*y/dy? is here multiplied by A*/m|a| ~ $?, and thus acquires the large (from 
the condition d « £) coefficient (€/d)*. We can neglect the potential A = A,(y) in this equation, 
which here leads to terms of a higher order of smallness in d/f. In order to avoid the need to 
consider the dependence of y on y, we average equation (45.12) over the film thickness; the 
derivatives with respect to y then disappear, because of the boundary condition dy/dy = Oat 
the surface of the film. Noting also that 

OP soit) 
eat * (Terit á 
because of the z-dependence of the phase of y (and the relation between its gradient and the 
current), we find, after cancelling y, 


Oom n 
lyr lal+b\y|? =0, 
where 
ale "PHS? 
ee agya CAID 
=q ll P dy =F ET 


—d]2 


Using also (45.9) and (45.16), we arrive at the equation 
az | (85) = ,_ lv! 
24 Hô] — yr ’ 


which determines y for a film in a magnetic field. The critical field Hf for the film is that for ` 
which y = 0. It is related to the critical field H, for a bulk superconductor by 


H?! = 4/(24) H,ôld. 





In the conditions considered, the removal of superconductivity by the field takes place 
through a phase transition of the second kind: y tends to zero continuously as © increases. 
This is entirely reasonable, since for d « 6 the field actually penetrates into the superconduct- 
ing film and there is no cause for a transition of the first kind, which would consist in a sudden 
penetration of the field into the body. 


§ 46. Surface tension at the boundary of superconducting and normal phases 


The Ginzburg-Landau equations allow, in particular, the calculation of the 
surface tension at the boundary of superconducting (s) and normal (n) phases 
(in the same sample) in terms of bulk characteristics of the material (V. L. 
Ginzburg and L. D. Landau 1950). Such boundaries exist in metallic samples 
that are in the “intermediate” state in a magnetic field. Since the only difference 
between the two phases is that yis zero in one but not in the other, the transition 
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between them is continuous over a certain layer and is described by the Ginz- 
burg-Landau equations with boundary conditions established only at large 
distances on either side of this layer. 

Let us considet a plane interface between n and s phases in a metal, taking 
the interface as the yz-plane and the x-axis into the s phase; the distribution of 
all quantities in both phases depends only on the coordinate x. The vector 
potential of the field, the choice of which is not yet uniquely specified, will 
now be subjected to the gauge in which div A = 0; in the present problem this 
gives dA,/dx = 0, whence we see that it is possible to take A, = 0. It is evident 
from symmetry that the vector A is everywhere in one plane; let this be the 
xy-plane, so that A, = A; then the induction vector is in the xz-plane, with 


B=B, =A’ (46.1) 


(the prime denoting differentiation with respect to x). 
Next, we rewrite (45.13) in the form usual in macroscopic electrodynamics, 
curl H = 0, with the field strength H given byt 


H = B—4xM,_  ccurlM =j. 


From this equation, it follows in the present case that H = constant. Far from 
the interface, within the normal phase, the induction and the field are the same 
and equal to the critical field: B = H = H, (we neglect the magnetic suscepti- 
bility of the normal phase). Hence H = H, = H, in all space. 

Neglecting the change in the density of the material in the superconducting 
phase transition, we shall regard the density (and the temperature) as constant 
throughout the body.* Let f denote the free energy per unit volume (in contrast 
to F, the free energy of the whole body). At constant temperature and density, 
and with surface effects neglected, the differential df is 


df = H.dB/4n; (46.2) 


cf. ECM, §30. Hence we see that the additional requirement of constant B 
would lead in these conditions to constancy of 


f = f-H.B/4n. (46.3) 


Thus the whole contribution to the integral F = f JaV from the variable part 
of F is due only to the presence of the interface. Taking this contribution per 


t To avoid misunderstanding, we may mention that the comment in ECM §41 about the 
unsuitability of using H referred to the electrodynamics of superconductors, where the range 
of penetration of the magnetic field was regarded as infinitesimally short. The Ginzburg~ 
Landau equations, however, are applied to the structure of precisely this region. 

* Strictly speaking, in phase equilibrium the chemical potential, not the density, is constant 
throughout the system. When taking account of the change in the density it would therefore 
be necessary to consider the thermodynamic potential 2, not the free energy. 
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unit area of the interface, we can therefore calculate the surface tension coeff- 
cient as the integral 


tins = f (F-J) dx, (46.4) 


where the constant f, is the value of f far from the interface, for example within 
the normal phase. 
For the normal phase, the free energy f, = f+ B?/82 = fo + H2/82, so that 
Í n = fa — He/4n 
= fno — Hèl 8n 
= Jı no ~~ a?| 2b ’ 


where the last expression is found by means of (45.9). The quantity f at any 
point is expressed in terms of the free energy density f by 


f = f—H,B/4a. 


Now, using (45.10), we reach the following formula for the surface tension: 


oo 


B? h2 4e? 1 H.B YS a? 
Ons = {| (iv't+ rA*ipP+aly lt) +7 iyli- iz +35 |e 








ar Am Rc 


(46.5) 


The integrand vanishes, as it should, both within the normal phase (x + — æ), 
where y = 0 and B = H., and within the superconducting phase (x + œ), 
where | y |? = —a/b, B = 0. 

It should be noted that, in the integrand of (46.5), the term iA.vy does not 
appear, since A, = 0. The corresponding term also does not appear in (45.12), 
and so the equation remaining has real coefficients; its solution may therefore 
be taken real, as will be assumed below. The first term in the current density 
(45.14) disappears, leaving 


j = —(2e2/mc) pA. (46.6) 


We shall use instead of the variable x and the functions A(x) and p(x) the 
dimensionless quantities 


Z =x/ô, $= yy(b/|a|), A=A/H.5, B= dA/dz = B/H.. (46.7) 


In the rest of this section, only these quantities will be used, and the bars over 
the letters will be omitted, for brevity. In these variables, equation (45.12) 
becomes 


y” = 2[( Ae 1) pty] (46.8) 
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Equation (45.13) with j from (46.6) is 
A” = Ay. (46.9) 


The boundary conditions on these equations in the problem considered 
(corresponding to the n and s phases as x --— œ and x + oo) are 


=0, B=A'=1 at = — æ, 
7 E | (46.10) 
y=1, A =0 at x =o. 
It is easily verified that equations (46.8), (46.9) have the first integral 
(2/x2) p'2-+ (2— A?) p?— yt + A’? = constant = 1, (46.11) 


the value of the constant being determined from the boundary conditions.* 
Lastly, the expression (46.5) becomes 


oo 





ôH? 2 
Ge = -p | |r v+- yh yt (A= e| dx 
_ ôH? j OF tan Sia 
aa | ae +a(a'—0)| dx, (46.12) 


— 00 


with the second equation obtained by taking yt from (46.11). 

Let us now examine the above equations, and take first the case x < 1 (which 
usually occurs in superconducting pure metals). This inequality signifies that 
6(T) < &(T), i.e. the magnetic field varies considerably over a distance small 
compared with the characteristic distance of variation of the function y(x). 

Figure 6 shows diagrammatically the distribution pattern for the field and y 
in this case. Where the field is large we have y ~ 0; then the field falls abruptly 
and y(x) begins to change slowly (over distances ~ 1/x) in the absence of the 
field. Putting A = 0 in (46.11), we find the equation 


v= zlo) 





t From the conditions (46.10) it necessarily follows that y’ = 0 at x = +o, and from the 
same conditions and (46.9), that A” = O and A = Oat x = oo; the definite value of A(co) is 
the result of taking y real. 
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which is to be solved with the condition y = 0 at x = 0, taken somewhere in 
the region of decreasing field. This solution is 


y = tanh xx//2, | (46.13) 


and a calculation of the integral (46.12) with this function (and A = 0) gives 


Hô H 1.96 


a a x (1919) 


ns = 

The error in this value is due to neglecting here the contribution to the inte- 
gral from the region where the field decreases. To estimate the width 6; of this 
region,’ we note, first, that 1/62 ~ y? from (46.9); second, that formula 
(46.13) must remain valid in order of magnitude even at the boundary of the 
region x ~ 6;, whence y ~ xô. From these two relations we find 61 ~ 1/+/. 
The contribution to the surface tension from this region is ~ H?6/+/x, i.e. 
is small in comparison with (46.14) only in the ratio ~ +/x (so that the accuracy 
of (46.14) is fairly low). 

When the parameter x increases, the surface tension coefficient passes 
through zero and becomes negative. This is evident from the fact that the 
inequality «,,< 0 is always satisfied for sufficiently large x: the characteristic 
distances of variation of y(x) in this problem cannot be less than those for A(x), 
since any change in A causes a change in y; hence, for large x, the term y’2/x? 
in the integrand in (46.12) may be neglected, and the integrand is negative since 
0< A’ < 1 (i.e.0 < B < H, in ordinary units). We shall show that «,, is zero at 





To do So, we rewrite the expression for «,, as 
Ons = si | [(A’ — 1)?— p4] dx, (46.16) 


which is obtained from the first integral (46.12) by integrating the term y’? by 
parts and then substituting y” from (46.8). The integral is certainly zero if the 
integrand is identically zero, i.e. if 


E Sees: (46.17) 


the opposite sign cannot occur, since the field B = A’ must decrease with 
increasing x. Eliminating y from (46.17) and (46.9), we find 


A” = A-A’), (46.18) 


t We must emphasize that ô, is not the same as the field penetration depth in a super- 
conductor adjoining a vacuum. In the latter case, y~ 1 in the field penetration region, where- 
as in penetration from the n phase the field decreases in a region where y is small. 


§ 46 Surface Tension at Boundary of Phases 189 


the solution of which (with the boundary conditions A’ = 1 at x = — œ and 
A = Oat x = œ) determines the field distribution; by virtue of (46.17), the 
boundary conditions (46.10) for y are then satisfied automatically. We need 
not actually solve (46.18), but simply verify that for x? = 4 the equation (46.8), 
which has not yet been used, or equivalently its first integral (46.11), must 
necessarily be satisfied. Substituting (46.17) in (46.9), we obtain y' = —}Ay; 
this value of y’, with A’ from (46.17), in fact satisfies equation (46.11) identically 
with x? = +, 


PROBLEM 


For a superconductor with x « 1, find the first field correction to the penetration depth in 
weak fields. 


SOLUTION. We take the surface of the superconductor as the yz-plane, with the z-axis in the 
direction of the external field, and the x-axis into the body. The distribution of the field and 
y in the superconductor is given by equations (46.8), (46.9), which are to be solved with the 
boundary conditions 

y=0, B=A’= at x=0, 


y =1, A=0 at x =0o; 
the first of these is (45.15). We seek the solution in the form 
y= 1+9,(x), 4,=—De-*+4i@), 


where p, and A, are small corrections to the solution atx = 0, which corresponds to the Lon- 
don decay of the field (44.13). The correction y, is given by the equation 


VY = yit e-s, 


whence, with the boundary conditions, 





p= eet egos ng eV (1) 
For A, we can now write the equation 
Ay = A,—29e-*1, 
and substitute for y, only the second term in (1), which is of the first order in x. Using the 


boundary condition (4; = 0 when x =0) and neglecting where possible the higher-order 
terms with respect to x in the coefficients, we find 


Ay =—3 9°04 2x) e-z- etv], (2) 
This gives the corrections to the field decay within the superconductor. The effective pene- 
tration depth 6,4; is, by definition, such that 


Gen = | B(x) dx = —- A0) = §-A,(0). 
0 


Returning to ordinary units, we find from (2) 


tao [iaia (By 
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§ 47. The two types of superconductor 


The sign of the surface tension «x, has a considerable influence on the prope 
erties of superconductors. This forms the basis for the division of all super- 
conductors into two classes: those of the first kind, with «,, > 0, and those of 
the second kind, with «,, < 0. Since the sign of «,, is governed by the value of 
the Ginzburg-Landau parameter x, values x < 1/4/2 correspond (near T,) to 
the former and x > 1/4/2 to the latter." 

Let us consider a solid cylindrical superconductor in a longitudinal external 
magnetic field H. If the superconductor is of the first kind, it undergoes a phase 
transition of the first kind as the field increases to a critical value H,. The role 
of the surface tension is then (as in any phase transition of the first kind) to 
impede the formation of the first nuclei of the new phase and thus make pos- 
sible a metastable continuance of the s phase at fields somewhat above H.. 

If the superconductor is of the second kind, however, the occurrence of 
“inclusions” of the n phase may be thermodynamically favourable even before 
His reached; the increase of volume energy is compensated by the negative 
surface energy of such a nucleus. The lower limit of fields for which this is 
possible is usually denoted by H,, and called the lower critical field. Similarly, 
starting from a metal in the normal state in a high external field, we reach a 
value H > H, the upper critical field, below which the occurrence of “in- 
clusions” of the s phase is thermodynamically favourable, again because of the 
advantage from the negative energy of the boundaries. Thus, over a certain 
range of fields, Ha < H < H,» the superconductor is in a mixed state.* 
Its properties in this state gradually change from purely superconducting at 





Fic. 7. 


t The first kind includes superconducting pure metallic elements; the second kind includes 
superconducting alloys. The hypothesis that x > 1/+/2 in alloys was first put forward by L. D. 
Landau. 

* Not to be confused with the intermediate state of a superconductor of the first kind, 
which results from certain configurations of the sample in the external magnetic field. 
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H to purely normal at H,,; at the same time, the magnetic field gradually 
penetrates into it. The value of H, determined only by the relation between 
the bulk energies of the n and s phases has no special significance in this case. 

The two critical fields depend, of course, on the temperature, and become 
zero at T = T,. This gives a phase diagram as shown in Fig. 7 for super- 
conductors of the second kind. The broken curve in Fig. 7 is explained below. 

The upper critical field can be determined (in the Ginzburg-Landau theory) 
even without previously ascertaining the structure of the mixed state. We need 
only observe that in fields somewhat below H,. a nucleus of the s phase must 
have a small value of the order parameter y; it is evident that y — 0 when 9 
tends to H». Hence the state of these nuclei can be described by the Ginzburg- 
Landau equations linearized with respect to y. Omitting the non-linear term in 
(45.12), we obtain the equation 

1 


dae i 
am (eA) w= tale, (47.1) 


where A is to be taken as the vector potential of the uniform field Q at y = 0, 
when the body is in the normal state and the external field penetrates it com- 
pletely. 

But (47.1) is in form just the Schrédinger’s equation for a particle of mass 
2m and charge 2e in a magnetic field, with | a| as the energy level. The boundary 
conditions also agree: y = 0 at infinity. It is known (see QM, §112) that the 
minimum energy of a particle moving ina uniform magnetic field is Eo = fhwy, 
where wy = 2'e'/2mc; this is the energy value at which the continuous energy 
spectrum begins. The analogy between the two problems therefore shows that 
the s-phase nuclei described by (47.1) can exist only if 


[a| > Je|hD/2mc, 


so that the critical field H = 2mc|a|/|e|#. By means of (45.9), (45.17) and 
(45.18) this formula may be written 


He = 2H, (47.2) 


(A. A. Abrikosov 1952). 

The solution of equation (47.1), with the boundary condition y = 0 at 
infinity, corresponds to the formation of an s-phase nucleus within the sample, 
far from its surface. We shall show that the presence of the surface favours 
nucleation, and that nuclei may thus be formed in a thin surface layer even if 
§ > H (P. G. de Gennes and D. Saint-James 1963). . 

The solution of equation (47.1), describing an s-phase nucleus near the sur- 
face of the body (assumed plane), must satisfy on the surface the boundary 
condition Oy/Ox = 0, where x is the coordinate along the normal to the surface 
(the condition (45.15) with A, = 0). To establish the required quantum- 
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mechanical analogy, we recall that the above-mentioned problem of the motion 
of a particle in a uniform magnetic field is in turn equivalent to that of motion 
in a one-dimensional parabolic potential well 


U = $ 2mo%(x —x0)?, 


where Xo is a constant corresponding to the “centre of the orbit” (see OM, 
§112). Let us now consider a double well consisting of two equal parabolic 
wells lying symmetrically relative to the plane x = 0 (Fig. 8). The ground 
state of a particle in such a field corresponds to a wave function y(x) that is 


Xx) 


-Xo Xo x 
Fic. 8. 


even in x and has no zeros; such a function automatically satisfies the condition 
y’ = Oat x = 0. The ground level of a particle in the double well is, however, 
below that in the single well; when applied to the nucleation problem, this 
proves the above assertion about their easier formation near the surface. 

Numerical calculation of the level in a double well gives the result that its 
minimum value (as a function of the parameter xo) is 0.59Eo. Repeating the 
arguments that led to (47.2), we find that the upper limit of fields in which sur- 
face nuclei of the s phase occur is H, = H,2/0.59, i.e. 


Ha = 1.7H eq = 2.4xH.. (47.3) 


Thus, in the range of fields between H, and H,, there arises the phenomenon 
of surface superconductivity; the boundary of this region is shown by the 
broken curve in Fig. 7. The thickness of the superconducting layer at the surface 
of the normal phase is of the order of &(T’). This estimate is easily derived from 
the same quantum-mechanical analogy: the wave function of a particle in a 
potential well (at the level Eo) is concentrated in the region x ~f/+/(mEo). The 
corresponding dimension of the nucleus is obtained by replacing Eo by lal 
and is, according to (45.17), &(7). 

The above discussion relates entirely to superconductors of the second kind, 
but the critical fields H, and H,, defined here may have a certain physical 
significance for those of the first kind also. 

If x is in the range 1/4/2 = 0.71 >x > 0.59/4/2 = 0.42, then H < H, but 
H., > H. Although no mixed phase occurs in this case, there is surface super- 
conductivity in the range of fields between H, and H3. 


t This is because the potential energy in the half-space x < 0 is less than it would be for 
a single well shown by the broken curve in Fig. 8; see, for example, QM, §50, Problem 3. 
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Finally, in the sense of the above derivation, the value of H,. (47.2) deter- 
mines (for any x) the upper limit of fields in which the formation of s-phase 
nuclei with arbitrarily small y is possible. Hence, in a superconductor of the 
first kind (where H < H,) in fields H < H,., the thermodynamically un- 
favourable normal phase is absolutely unstable; but in the range Ha < 9 < 
< H, the normal phase can exist as a metastable phase. In this range, the phase 
transition of the first kind from the n to the s phase can only occur by the for- 
mation of s-phase nuclei with finite values of y, which is opposed by the posi- 
tive surface tension at their boundaries (V. L. Ginzburg 1956). 


PROBLEM 


Determine the critical field for a superconducting sphere of small radius R « ô (V. L. 
Ginzburg 1958). 


SoLuTION. In this case (as in a thin film; see §45, Problem) the superconductivity is de- 
stroyed by a phase transition of the second kind. The critical field for a sphere may be found 
as that below which the n phase ceases to be stable with respect to the formation of s-phase 
nuclei. As in the text, this amounts to finding the lowest eigenvalue of the Schrdédinger’s equa- 
tion (47.1). With the condition R « 6, this may be sought by means of perturbation theory 
with respect to the external field, the unperturbed wave function being y=constant (the nuc- 
leus occupies the whole volume of the sphere). The eigenvalue is then simply the mean value 
of the perturbation operator (2eA/c)*/4m (the mean value of the operator (/ef/mc) A.V for 
== constant is zero). The vector potential of the uniform field must here be taken as A=49) Xr; 
with this gauge, the solution y = constant satisfies on the surface of the sphere the boundary 
condition (45.15), which reduces ton.A = 0. The result of the averaging is 


e 2 eR 


za E At a ee 
Ey = 4m? 3 pr 10mc2 ° 


The critical field is found, as in the text, from the condition Ey = lal, which gives 
H = 4/20H,6/R. 


The legitimacy of using perturbation theory is confirmed by the fact that the value found 
for E, (at © = H'?4), with the condition R « 4, is in fact small compared with the next eigen- 
value, which would correspond to a wave function varying within the sphere and would be of 
the order of A?/mR’. 


§ 48. The structure of the mixed state 


We shall again consider (as in §47) a cylindrical sample of a superconductor of 
the second kind in a longitudinal magnetic field H, and ascertain the structure 
of the mixed state of the body in fields slightly exceeding the lower critical 
field Hat 

In this case there are nuclei of the normal phase in the main superconducting 
phase. To attain the maximum thermodynamic favourability they must have 
(with negative surface tension) the largest possible surface. The structure ex- 
pected is therefore one in which the n-phase nuclei are filaments parallel to the 


t The results in this section and in the Problems are due to A. A. Abrikosov (1957). 
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field. The magnetic field that penetrates into the body and the annular super- 
conductivity currents surrounding these vortex filaments are concentrated near 
the filaments. 

As the external field approaches H, the number of such filaments in the 
body decreases and the distance between them increases. When this distance 
is sufficiently great, the arguments given at the end of §44 become applicable 
to the individual vortex filaments, whereby the total magnetic flux concentrat- 
ed near a filament must be an integral multiple of the flux quantum ġo = 
= ahic/|e|; we shall see later that filaments with the lowest possible flux, o 
itself, are thermodynamically favourable. The fact that ġo is not zero is what 
sets a limit to the further fragmentation of the n-phase nuclei. 

When the external field, increasing from low values, reaches H4, one vortex 
filament appears in the cylinder. We can write down the thermodynamic con- 
dition that determines this point without for the present investigating the struc- 
ture of the filament itself, but merely using the fact that it is associated with 
some (positive) energy; this energy per unit length of the filament will be denot- 
ed by e (and calculated below). 

It is evident that, in a cylindrical body in a longitudinal external field, the 
induction B also will be everywhere parallel to the axis of the cylinder. The 
same is true of the macroscopic field H = B—4xM defined in §46. The equa- 
tion curl H = 0 then shows that H is constant over the cross-section (and 
therefore throughout the volume) of the cylinder; because of the boundary 
condition that the tangential component of H is continuous, this constant value 
must be equal to the external field: H = &. Thus we have to consider the ther- 
modynamic equilibrium of the body for given volume, temperature, and field 
strength H. The condition for such an equilibrium is for F, the thermodynamic 
potential with respect to these variables, to be a minimum (see ECM, §30). 
Let F, be this potential for a superconducting cylinder; since B = 0 in the 
superconducting phase, F, is the same as the free energy F,. Then the potential 
F for a cylinder with one vortex filament will be 


F = F,+Le— | H.BdV/4x 
= F,+Le—§ f BdV/4z. 


The term Le ts the free energy of the filament (L being the length of the fila- 
ment, which is equal to that of the cylinder), and the last term is the difference 
between the potential F and the free energy F. Since the induction B is entirely 
concentrated near the vortex filament in the body, we have J BdV = Loo, 
where ġo is the flux of the induction through the cross-section of the filament. 
Thus y 
E = F,+Le—Loo/4x. (48.1) 


The occurrence of vortex filaments becomes thermodynamically favourable 
when the quantity added to F, is negative. Equating it to zero, we thus have as 
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the critical value of the external field 
Ha = Arel Po. (48.2) 


Let us now consider the structure of a single vortex filament. We shall take 


only the important case where 
x>l, (48.3) 


i. e. 6 > Ë. The length & determines the order of magnitude of the radius of 
the “core” of the filament, in which | y |? varies from zero (corresponding to the 
normal state on the filament axis) to the finite value corresponding to the main 
s phase; at large distances r from the filament axis, |y|? remains constant.‘ 
The induction B(r) varies much more slowly, decaying only at distances r ~ 
~ ô >> Ë. Thus essentially the whole of the magnetic flux passes through the 
region outside the core, where | y'? = constant (Fig. 9). 


2 


ly] 


| 


Fic. 9. 


The latter fact enables us:to use the London equation (whose validity, let us 
recall, does not depend on the temperature’s being close to T,) in order to find 
the field distribution. To put it in the appropriate form, we first rewrite the rela- 
tion (44.7) between the superconductivity current density and the phase of the 


wave function: 
A+6?curl B = po V®/2z, (48.4) 


using the penetration depth 6 and expressing j in terms of the induction by 
j = c curl B/4z. The London approximation corresponds to the assumption 
that ô is constant. We integrate (48.4) along a closed contour C that embraces 
the filament and passes at distances r>>§ from its axis. Transforming the integral 


t In this section, r will denote a cylindrical coordinate, the distance from the axis. 
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of A by Stokes’s theorem into an integral over a surface spanning the contour 
C, we find 
fB.df+6?¢curlB.dl = ġo, (48.5) 


and a similar transformation of the second integral gives 
f (B+ ô? curl curl B). df = ġo; (48.6) 


on the right we have written the lowest possible (non-zero) value, corresponding 
to a phase increase of only 27. If the contour C passes at distances r >> 6 from 
the filament, where the field and the currents may be regarded as zero, the second 
integral in (48.5) may be omitted, and we see that ġo is equal to the total flux of 
the induction concentrated around the isolated vortex filament. The filament 
axis itself is a line singularity, a passage around which alters the phase of the 
wave function. 

Since the equation (48.6) must be satisfied for any contour C that satisfies the 
conditions stated, it shows that we must have 


B +ô? curl curl B = B— 6? AB = ¢,6(r), (48.7) 


where r is the two-dimensional position vector in the plane of cross-section of 
the vortex filament. Writing the right-hand side of this equation as a delta 
function signifies that distances ~ & are here regarded as zero. In all space ex- 
cept the line r = 0, equation (48.7) is the same as the London equation (44.11), 
but to describe the vortex filament we need a solution with a singularity at 
r=0Q. 

The field distribution at distances r from the axis in the range 6 > r >» can 
be found directly from (48.5). We take as the contour C a circle of radius r in 
this range. The flux of induction through this contour, the first term on the left 
of (48.5), is only a small part of the total magnetic flux, in the ratio ~ (r/d)*; 
it will be neglected. In the second term, dlis an element of length of the circle; 
since the vector B is along the z-axis (in cylindrical polar coordinates with the 
axis along the filament) and depends only on r, we have 


1.(7 XB) = (Ixv).B = —OB,/Or = —dBldr, 


where l is a unit tangent vector to the circle. Thus we obtain the equation 





l.curl B = —dB/dr = ¢,/2nr0?, (48.8) 
whence 
B(r) = bo ioe E «r ò (48.9) 
2762 r? ° i 


Because of the logarithmic dependence, the upper limit of integration (at which 
we should have B ~ 0) may be taken equal to the upper limit of the range of 
distances r under consideration. 
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To continue the distribution found above into the range r 2 6, we use equa- 
tion (48.7), which is valid for all r >> &. Expanding the Laplacian operator in 
cylindrical polar coordinates (with B = B,(r)), we can rewrite the equation 
(forr = Ojas 

B” + B'/r + B/8? = 0. 


The solution of this equation that decreases as r —> co is 
B(r) = constant X K,(r/6), 


where Ko is the Macdonald function (the Hankel function with imaginary argu- 
ment). The constant coefficient is determined by “joining” to the solution 
(48.9), using the known limiting form Ko(z) ~ log (2/zy) for z « 1(y = ef = 
1.78). Thus we have finally 


Br) = oy K,(r/6), ræ ë. (48.10) 

By means of the known asymptotic expression Ko(z) ~ (2/22)'"e-? for z + œ, 

we therefore find, in particular, the law of decrease of the field far from the 

axis of the filament: 
$o 3 

B(r) = (Brr e rla, (48.11) 

Attention may be drawn to the evident analogy between the properties of 

vortex filaments in superconductors and those in liquid helium (§29). In both 

cases, they are line singularities, a passage around which alters the phase of the 

condensate wave function. The circular paths of superfluid motion round 

vortex filaments in liquid helium correspond to circular currents in the super- 

conductor; in the former, the velocity v, of superfluid motion decreases as 1/r, 
and in the latter the superconductivity current density 


ja 5 ‘curl B| = i (48.12) 
decreases in the same way. This agreement is to be expected, since in both cases. 
the relation is a direct consequence of the existence of the line singularity. But, 
whereas in liquid helium this relation v,(r) extends to all distances, in a super- 
_ conductor the decrease of j(r) becomes exponential for r >> ô. The difference is 
due to the charged state of the electron liquid: the motion of charged particles 
creates a magnetic field, which in turn screens the field; if the particle charge e 
is made to tend to zero, the penetration depth 6 — co. 

We can now calculate the free energy of a vortex filament. The contribution 
from the region of space outside the core (r >> &), is given by the integrals 


1 2 
Fa = 5> f Bea + f (curl B)? dV (48.13) 
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taken over this region. For, on varying this expression with respect to B (at a 
given temperature i.e. given 5), we immediately obtain the London equation 
(48.7) (for r =0).t The second integral in (48.13), which diverges logarithmically 
at both ends of the range 6 >> r >> §, is large in comparison with the first. Sub- 
stituting here |curl B' from (48.8), we obtain for the energy per unit length 
of the filament 

e = (o/4n0)? log (6/€). (48.14) 


This expression has logarithmic accuracy, i.e. we assume not only 6/& >> 1 but 
also log (6/&) >> 1; to this accuracy we can neglect the contribution to £ from the 
core of the filament. 

The result (48.14) allows, in particular, the proof of the statement made 
above, that the formation of vortex filaments with the lowest magnitude of the 
magnetic flux is thermodynamically favourable. Since the free energy of the 
filament is proportional to the square of the magnetic flux attached to the fila- 
ment, the energy for a filament with flux nho would contain a further factor 72, 
and the break-up of such filament into z filaments with flux ġo would lead to an 
n-fold gain in energy. 

Substituting (48.14) in (48.2), we find the lower critical field 


Po 


Ha = Fr 


log (6/8). (48.15) 


As T — T, this expression may also be written, by means of (45.19), as* 


log x 


Aa = e 


(48.16) 

As the external field increases, so does the number of vortex filaments and 
therefore the penetration of the magnetic field into the superconductor. When 
the interaction between filaments is taken into account, thermodynamic equilib- 
rium corresponds to a certain ordered configuration of the filaments, forming 
a two-dimensional lattice in the plane of cross-section of the cylinder.’ For any 
number density of filaments, the axis of each remains a line such that passage 
around it alters the phase of the wave function y by 2x. The mean value (over 
the cross-section of the cylinder) of the induction is 


B = voo, (48.17) 


t The second term in (48.13), expressed as a function of the current j, is 
27e?ô? f pav = f 40? dV; 


in the second expression we have also substituted 6? = mc?/4me’n,, and the density and ve- 
locity of the superfluid component according to į = e¢,v,/m; see the first footnote to §44. We 
see that this term can be regarded as the kinetic energy of the superconducting electrons. 

+ Since this formula has been derived on the assumption that log x >> 1, it cannot be used 
when ~ 1. In particular, forx = 1/+/2 the field H, (like H,2) must be simply H.. 

§ The most favourable lattice seems to be that formed by equilateral triangles with vortex 
filaments at their vertices. 
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where » is the number of filaments per unit area of the cross-section. For, if we 
integrate the relation (48.4) along the contour around the whole cross-section 
of the sample, we obtain equation (48.5) with Svoo on the right (S being the 
cross-sectional area); on the left-hand side, the first integral is the total flux SB 
of the induction, and the second represents an edge effect that is small in com- 
parison with the first, in the ratio ~ 6/R, and is therefore negligible (R is the 
linear size of the cross-section); here it is important, of course, that the field 
around the filaments decays at distances ~ 6. 

So long as the distances d between the filaments remain large in comparison 
with the correlation radius &, we can assert that the magnetic fields of the vortex 
filaments are simply additive: when d >> £, we can still draw a contour embrac- 
ing any number of vortex filaments in such a way as to pass everywhere far 
(at distances >> £) from their cores. On such a contour, the condition for the 
London approximation is satisfied (6 is constant), and we therefore again 
arrive at an equation that differs from (48.7) only in that the delta function on 
the right is replaced by a sum of delta functions of the distances from each fila- 
ment; since this equation is linear, the statement made is proved. 

When the external field approaches H», the distances between vortex fila- 
ments become comparable with &. This is clear also from the expression (47.2) 
tor the critical field if it is written, by means of (45.9) and (45.16)-(45.18), in 
fhe form 

H.z = bo/27&?; (48.18) 


it corresponds to a flux ġo concentrated on an area ~ &. 

The disappearance of the superconductivity at § = H, takes place as a phase 
transition of the second kind. In accordance with the general theory of such 
transitions, we can assert that the order parameter y as a function of the exter- 
nal field vanishes as | p |? oc H,.—. On the other hand, the magnetization of the 
substance M = (B— H)/4n, a quantity independent of the choice of the phase 
of y, is itself proportional to |p|? in this range. Since at = H,» we must also 
have B = H, we thus obtain a linear relation between the induction B in a 
superconductor and the external field near the transition point: 


B-Ha © §—He. (48.19) 


PROBLEMS 


PROBLEM 1. Calculate the energy of interaction of two vortex filaments at a distance d >> ¢ 
apart. 


SOLUTION. We transform the expression (48.13) for the free energy of a system of two 
vortex filaments to a form in which the integrations are taken only near each separate filament. 
To do so, we write, using equation (48.7), 

B? + ô?(curl B)? = 6*{—B.curl curl B+ (curl B)?} 
= 6? div (Bxcurl B). 
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The volume integral is transformed to 
Fin, = (67/8n) f Bxcurl B.df, (1) 


Sith, 


taken over the cylindrical surfaces fı and f (of small radius ro: È << rp «< 6) embracing the 
cores of the filaments. When d > é, the filament fields are additive, i.e. B = B,+B,. The 
filament interaction energy is then given by the part of the integral (1) that depends on both 
B, and B,: 

Lez = (67/82) { f BX curl B,.df,+ f B, Xcurl B,.df.}; 


the integrals of the form f B,X< curl B,- df, tend to zero with rọ. Using (48.8) and (48.10), 
we now find 


ô? fen 
E1 = 2 ga o Tar Bd) = ie sana Ko(d/9). 
In particular, at distances d >> 6, 
2 
e12 = apes ldea, 2 


PROBLEM 2. Determine the dependence of the mean (over the cross-section of a cylindrical 
sample) magnetic induction B on the external field in the mixed state when the vortex fila- 
ments are at distances d >> 6 apart, forming (in the cross-section of the sample) a lattice of 
equilateral triangles. 


SOLUTION. The area of an equilateral triangle is / 3đd?/4 (where d is the length of a side), and 
the number of filaments is half the number of triangles in the lattice (N triangles have 3N ver- 
tices, but each vertex in the lattice belongs to six triangles that meet there); hence »=2/+/3d". 

The thermodynamic potential f per unit volume of the body in the mixed state is 


f= p20. 4 Hat+9)+5 PE tk» 


where the second term corresponds to the expression (48.1) with H.ı from (48.2); in the third 
term, £1, is the energy of interaction of two filaments, and the summation is over all filaments 
passing through a unit area. Because of the exponential decrease of £, when d > ô, it is 
sufficient to consider only pairs of neighbouring filaments. In the triangular lattice, each fila- 
ment has six nearest neighbours, so that 


1 1 
> 2 Ei = 65 » En = 3vEz (d). 


Substituting £}: from (2), Problem 1, we find 


$ 0-H. 3h et 
f= hias AaOr al 


where a = d/ô. The dependence of aon § is determined by the condition for the function 
f(@ to be a minimum, which gives 


3 
§- Hi = Fons? Va e~ (3) 


he term of higher order in 1/a« 1 is omitted. This equation, together with B = vp, i.e. 
a = (26 9/+/30°B)"?, 


gives the required dependence B(O). As > Ha, the derivative d/d% tends to infinity accord- 
ing to 
dB 1 


d © Ha $ G- Ha Tr 
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§ 49, Diamagnetic susceptibility above the transition point 


It has been mentioned at the end of §45 that the range of temperatures T, in 
which the fluctuations of the order parameter y become large is extremely 
narrow in superconductors. Outside this range, the fluctuation corrections to the 
thermodynamic quantities are in general very small. They may, however, be 
important as regards the magnetic susceptibility of a metal above the transition 
point: the occurrence, because of fluctuations, of even a relatively small number 
of superconducting electrons may give a contribution to the susceptibility that 
exceeds the ordinarily very small susceptibility of the normal metal far from 
the transition point.‘ 

Let us consider a metal in a weak (9 « H.) external magnetic field at a tem- 
perature above but close to T,. The equilibrium value of the order parameter is 
here y = 0, and to calculate its fluctuations we can use the free energy from the 
Ginzburg-Landau theory. In the expression (45.10), since the fluctuations are 
small, we need retain only the terms quadratic in y, omitting the term in |y!4 
and taking A to be the vector potential of the uniform field H. The fluctuations 
of the induction B due to those of y are quadratic in y (since the current den- 
sity j is quadratic). Hence, in the term B?/87, we can take B to be the mean 
(thermodynamic) value of the induction, and neglect its fluctuations. Thus the 
change in the total free energy of the metal in a fluctuation is given by the follow- 
ing functional of y:* 


1 | 2e |2 
= | I -py £ l +alyl2l dv. i 
AFIy] all iiy- — a)y. +aly] lav (49.1) 
To calculate the fluctuational contribution JF to the free energy, we must 
regard the functional (49.1) as the “effective Hamiltonian”, which determines 
AF from the formula 


exp(—AF/T) = | exp (—AF[y]/T) Dy. (49.2) 


where the (functional) integration is taken over all distributions y(r); see Part 1, 
§147. In practice, it is carried out by expanding y in terms of some complete set 
of eigenfunctions and integrating over the infinite number of coefficients in this 
expansion. For a homogeneous system (without external field), the expansion is 
made simply with respect to plane waves (see, for example, the calculation of the 
fluctuational correction to the specific heat in Part 1,§147, Problem). 


t This effect was pointed out by V. V. Shmidt (1966). 

* To avoid misunderstanding, we should mention that the magnetic field is not, with regard 
to the superconductor, the “external field” A in the sense in which it was defined in Part 1, 
§144. The latter would have to appear in the free energy as a term — h(y +y*), which in the 
present case is certainly impossible because such a term is not invariant with respect to the 
choice of the phase of y. 
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In the present case, the expansion is to be made with respect to the eigen- 
functions of the “Schrddinger’s equation” 
1 2e 


im ae 
Cc 


2 
= A) y = Ey, (49.3) 


corresponding to the Hamiltonian (49.1). It has been mentioned in §47 that this 
equation is formally identical with Schrédinger’s equation for the motion of a 
particle (with mass 2m and charge 2e) in a uniform magnetic field. Its eigen- 
functions are labelled by one discrete (n) and two continuous (p,, p,) quantum 
numbers, the eigenvalues depending only on n and p, (the z-axis is in the direc- 
tion of H) and being given by 


E(n+%, pz) = (n+) lel AO/me + pz/4m; (49.4) 


the number of different eigenfunctions for a given n with p, in the range dp, and 
any possible p, 1s 
[V. 2|e| H/(27h) c] dpz 
(see QM,§112). 
For brevity, we shall denote the set of numbers n, p, p, by one symbol q, and 
write the expansion of the function y(r) as 


P = bi cael), (49.5) 


where c, = c,-+ic,’ are arbitrary complex coefficients and the eigenfunctions are 
assumed normalized by the condition f (Yal dV = 1 (with integration over the 
volume of the metal). 

Substitution of the expansion (49.5) in (49.1) allows, first of all, a change from 
integration over the volume to summation over q: integrating the first term by 
parts, we can bring (49.1) to the form 


| . 26:28 à 


Substituting (49.5) here aud noting that each of the functions y, satisfies equa- 
tion (49.3) with E = E, and that the eigenfunctions with different q are ortho- 
gonal, we find 
AF[y] = ¥ | Cg |*(Eq +a). (49.6) 
Q 


The functional integration in (49.2) denotes integration over all de;dc} - After 
the substitution of (49.6), the integrations over all these variables separate, 
giving 


exp (— AFIT) = [| pz 
Q qd 
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or 


nT 


In terms of the quantum numbers n and p, this expression becomes 


E pean 49.8 
Á ESE yf Ë Enna ma 


This sum diverges for large E, but the divergence is in fact spurious and due 
only to the fact that the original formula (49.1) is applicable only for slowly 
varying functions y(r): the change in y over distances ~£o must be small. In 
terms of the eigenvalues E,, this means that only E, « 4?/mt; are allowable. 
Cutting off the sum over n at some large N which satisfies the condition stated, 
we use Poisson’s formula 


N N 
Sv fath) ~ Sd- AO; 
n=0 0 

see Part 1, (59.10). When applied to (49.8), the integral term here is easily seen to 
give a contribution to the free energy that is independent of $; this term is not 
needed in calculating the magnetic susceptibility, and we shall omit it. In the 
second term, we can now let N > œ, so that the cut-off parameter does not 
appear in the result :t 


dpe eet | dp- 


487?hmc? a+ p2/4m ` 


— 00o 


Finally, calculation of the integral gives 


eT? 
Hence the magnetic susceptibility is 
1 & AF eT, 
CE age maT O 


(H. Schmidt 1968, A. Schmid 1969). We see that the susceptibility increases 
as (T—T,)~* near the transition point. In this range, (49.10) is the principal 
contribution to the magnetic susceptibility of a normal metal. 


t In the coefficient we have put T ~ 7,. For T close to 7., the important values in this in- 
tegral are p, ~ y (ma) ~ h/&(T) < h/£o, i.e. satisfy the requirement stated. 
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PROBLEMS 


PROBLEM 1. Determine the magnetic moment of a thin film (thickness d« &(T)) in a weak 
magnetic field perpendicular to its plane at temperatures T > T, with T- T, « T,. 


SOLUTION. The finite thickness of the film makes the quantum number p, in (49.4) discrete; 
for a thin film, we must take in (49.7) only the value p, = 0; even the first non-zero value 
P: ~ h/d,so that E ~ #?/md* > h?/m&* ~ a. The number of eigenfunctions with given n and 
P, (and any possible p,) is 2|e| 9.S/27fc, where S is the area of the film; hence the summation 
over q in (49.7) is to be taken as (OS/the)>". Applying Poisson’s formula to the sum, we get 


| ETK 
APES 24rmc?a ` 
The magnetic moment of the film is 
E E EE 


lama T-T) 
It should be noted that this increases faster, as T — T,, than for an infinite metal. 


PROBLEM 2. The same as Problem 1, but for a sphere of radius R « (T) (V. V. Shmidt 
1966). 


SOLUTION. In this case, of all the eigenvalues of equation (49.3), only the lowest is impor- 
tant, corresponding to the eigenfunction y = constant, and equal to Ey= e?R*H?/10mc"; see 
the discussion in §47, Problem. The sum (49.7) reduces to a single term, and the magnetic 
moment is 


§ 50. The Josephson effect 


Let us consider two superconductors separated by a thin layer of an insulator. 
For electrons, this layer is a potential barrier, and if it is sufficiently thin there 
is a finite probability that they will penetrate it by quantum tunnelling. Even if 
the transmission coefficient of the barrier is small, its difference from Zero is of 
fundamental importance: the two superconductors become one system de- 
scribed by a single condensate wave function. This leads to effects first predic- 
ted by B. D. Josephson (1962). 

Because there is a single condensate wave function of the system, a super- 
conductivity current can flow through the contact between the two supercon- 
ductors even when no external potential difference is applied. Just as the cur- 
rent density within the superconductors is determined by the gradient of the 
phase ® of the condensate wave function, so the density j of the superconduc- 
tivity current through the contact depends on the difference of the phase values 
®, and®, on the two sides of the contact.‘ Since values of the difference®,—®, 


t In order that the superconductivity current through the contact should have an appreci- 
able value, the thickness of the insulating layer must in fact be very small, ~107? cm. Such 
distances are small even in comparison with the smallest characteristic length parameter of the 
superconductor, the coherence length £. In this sense the layer is to be regarded as of in- 
finitesimal thickness, and the behaviour of the phase within it does not appear in the theory 
atall. 
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which differ by an integral multiple of 2x are physically identical, it is clear 
that the function 
j=j@n), On = Pz- P, (50.1) 


must be periodic, with period 2x. The operation of time reversal changes the 
sign of the current j and also that of the phase M21, since the wave functions are 
replaced by their complex conjugates. This means that the function (50.1) must 
be odd, and is zero when Øz, = 0. Being bounded, of course, j(®21) has maxi- 
mum and minimum values, between which it varies with the phase difference; 
since the function is odd, these values are equal in magnitude, and will be 
denoted by + jm 

The form (50.1) presupposes that the current is not affected by the magnetic 
field of the currents within the contact. In the contrary case, the difference D2; 
would have to be replaced by the gauge-invariant expression 


2 
Da-i ge | Ande 
Åc 
1 


Because the thickness of the insulating layer is very small, the condition for 
the integral of the continuous function 4,(x) to be negligible is easily satisfied, 
and the values of A, itself on either side of the contact may be regarded as 
equal. 

The form of the function j(®21) at all temperatures can be established only 
from the microscopic theory. We shall give here only a phenomenological 
treatment within the range of applicability of the Ginzburg-Landau theory. 

If the contact were entirely impermeable to electrons, the wave functions y 
in each superconductor would satisfy at the boundary of the contact the con- 
ditions (45.15): 

OY, 2ie Ope 2ie 


Bx Fg TO g Fg Aas = 0 


The finite permeability of the barrier and the finite value of y at the boundaries 
of the contact lead to non-zero expressions on the right-hand sides of these 
conditions, depending on the values of y on the other side of the contact. 
Since y is small (near the transition point T,), we need consider only the terms 
linear in ọ in these functions, putting 


(50.2) 


the coefficient 1/2 being proportional to the permeability of the barrier. The 
equations (50.2) must satisfy the requirements of symmetry under time reversal, 
remaining valid under the transformation y > y*, A — —A, whence it 
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follows that the constant / is real; then, under the transformation mentioned, 
equations (50.2) simply become their complex conjugates. 

The relation between the superconductivity current through the contact and 
the difference between the phases of y can be determined by applying formula 
(45.14) to either side of the contact, say side 1: 
ieħ ( OY) a 2e? 





am Pilay Y! ox — rpe Avi. 


Substituting Oy1/0x from the boundary condition (50.2), we find 
ah adi! 
J= Fmi (VIY — PY). 


For contacts of the same metal, yı and we differ only in phase, and we then 
have for the current density 


J =Jm Sin Do, jm == (eh/md)|y |. (50.3) 


As the transition point is approached, |p|? tends to zero asT,—T, and there- 
fore so does the maximum current density through the contact." 

Now let a potential difference be applied from an external source to the 
tunnel contact, so that there is an electric field E in the contact. We shall describe 
this field by a scalar potential, denoted by V:E = — VV. The influence of 
this field on the superconductivity current through the contact can be ascer- 
tained from the requirements of gauge invariance. 

In the absence of the field (V = 0), the phase of the wave function is inde- 
pendent of time: 0®/dr = 0.* To generalize this equation to the case where 
the electric field is present, we note that the general relation must be invariant 
under the gauge transformation 


1 ôx) 
V ar ar (50.4) 





of the scalar potential, which does not affect the vector potential (assumed in- 
dependent of time). Just as in the derivation of the transformation (44.3), 
(44.6), we find that together with V, the phase of the wave function must be 


transformed, by 
D — G+ (2e/fic) y(t). (50.5) 


t The microscopic theory based on the BCS model shows that a relation of the type (50.3) 
between j and ®,, is valid at all temperatures. The same theory gives a relation between jm 
and the electrical resistance of the contact between two metals in the normal state. This 
theory is described by I. O. Kulik and I. K. Yanson, The Josephson Effect in Superconductive 
Tunneling Structures, Israel Program for Scientific Translations, Jerusalem 1972. 

tł The time factor exp (—2iut/t) is eliminated from the wave function because the Hamil- 


tonian Ĥ of the system is replaced by H’ = H- uN ; cf. the sixth footnote to §31. 
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Hence it is clear that the relation 


OP ry _o (50.6) 


is gauge-invariant; it becomes 0®@/0t = 0 when V = 0. 
When the electric field is independent of time, integration of (50.6) gives 


D = DO —(2e/h) Vt, 


where ® is independent of time. Hence, if a constant electric potential differ- 
ence Va is applied to the contact, the phase difference there is 


Dy, = DY —(2e/h) Vat. 


Substituting this expression in (50.3), we find the superconductivity current 
through the contact: 
j = jm Sin (DO —(2e/h) Voit). (50.7) 


We thus arrive at a noteworthy result: the application of a constant potential 
difference to the tunnel contact causes the appearance of a superconductivity 
alternating current with frequency 


wj = 2 eVa A. (50.8) 


The power consumed at the contact is JV 21; its (time) average value is zero, 
i.e. there is no systematic expenditure of energy by the external source; this 
is as it should be for a superconductivity current, which does not involve any 
dissipation of energy. We must emphasize, however, that when there is an ex- 
ternal e.m.f. there will also be a normal current (weak when V», is small) 
through the contact, and this is accompanied by dissipation. 

The conclusion that the superconductivity current through the contact varies 
periodically with the frequency (50.8) follows, in fact, from the periodic 
dependence of j on ®2; and the linear time dependence of ®2i; it does not 
rest on any assumptions about the magnitude of the potential difference. 
The specific formula (50.7) is valid only if the frequency œ; is small in compari- 
son with the frequency 4/f that characterizes the superconductivity : 


hw; =2\eV| « A(T). (50.9) 


PROBLEM 


Write down the equation for the current in a circuit consisting of a resistor R connected in 
series with a superconductor having a tunnel contact, with ane.m.f. Vp acting in the circuit. 


SOLUTION. The total! voltage drop in the circuit is Vj = RJ+V.,, where J is the current 
in the circuit and V2, the potential difference across the contact.t Substituting J = Jm sin ®.1, 


t We neglect the normal current in the superconductor, which is small if Vo is small. 
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and F}, from (50.6), we find 


h OD, 
2'e] ôt 


The variable current described by this equation is not sinusoidal. 


= Vo— RI m sin ®,,. 








§ 51. Relation between current and magnetic field in a superconductor 


Formulae have been derived in §44 which give the relation between the current 
and the magnetic field in a superconductor in the limiting (London) case 
where all quantities vary slowly through the body; the field was assumed to 
be much less than the critical value. Let us now consider this problem in the 
general case where the static field, though still assumed weak, varies in any 
manner in space. The words “varies in any manner” here mean that the field 
may vary considerably over distances ~ £o (but, of course, will still vary only 
slightly over distances of the order of the lattice constant; the inhomogeneity 
of the metal over atomic distances is therefore unimportant). 

In the general case, the relation between the current and the magnetic field 
in a Spatially infinite medium is given by an integral formula of the type 


Jit) = — f Onde r’) Alt’) Bx’, (51.1) 


where the kernel Qy depends only on the properties of the medium itself. 
The linearity of (51.1) corresponds to the assumption that the field is weak. 

The current density may be regarded as the variational derivative of the 
energy of the system with respect to the vector potential: the change in the 
Hamiltonian of the system when A is varied is 


ôH = — (i/c) f i.6A dx; 


see OM, (115.1). Hence the kernel Q,, in (51.1) is the second variational deriv- 
ative, and the symmetry as regards the order of the twofold differentiation 
(with respect to 4,(r) and A,(r’)) has the result that 


Qilt = r’) = Ox:(r' = r). (51.2) 


Expanding A(r) and j(r) in Fourier integrals, we can write the relation (51.1) 
for the Fourier components as 


jik) = — Oix(k) 4k), (51.3) 
where, from (51.2), Q (k) = O,(—k). 


t The problem of an infinite medium here has only formal significance. Its actual impor- 
tance lies in the subsequent application of the results to the problem of a finite medium (§52). 
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Some important properties of the function Q,,(k) follow from the require- 
ments of gauge invariance. The current j must be unaltered by the gauge trans- 
formation A(r) — A(r)+7x(r) or, in Fourier components, 


A(k) > A(k) +7ky(k). 
This means that the tensor Q,,(k) must be orthogonal to the wave vector: 
Oix(k) ky = 0. (51.4) 


In particular, in a crystal with cubic symmetry, the tensor dependence of Q; 
reduces to terms of the forms 6, and k;,k,; it then follows from (51.4) that 


Qik = (èx — “Er O(k), (51.5) 


where O(k) is a scalar function. 

We now choose a potential gauge such that div A(r) = 0. This implies 
that for the Fourier components k.A(k) = 0. Hence the relation (51.3) be- 
tween the current and the potential reduces to 


i(k) = —Q(k) A(k), (51.6) 


i.e. is determined only by the scalar function Q(k). 
The London case corresponds to the limit of O(k) as k — 0. This is easily 
found by taking the curl of both sides of equation (44.8), 


curl j = —(e’n,/mc) curl A, 


and using the fact that div A = 0. Since the equation of continuity gives div j = 
= 0, we find 
Aj = —(e2ns/mc) AA. 


In infinite space with the functions j(r) and A(r) everywhere finite, it then fol- 


lows that 
į) = —(e’n,/mc) AQ), (51.7) 


i.e. the value of the current at every point is determined only by the value of 
the potential at that point. A similar equation is valid between the Fourier 
components j(k) and A(k), and comparison with (51.6) shows that Q(k) is 
independent of k:* 


O(k) = e2n;/mc (= c/4ad? as k — 0). (51.8) 


The rest of this section will deal with the calculation of Q(k) for the BCS 
model, which supposes, as already mentioned, an isotropic degenerate Fermi 


t In this and the following sections, the London penetration depth is denoted by 6,. 
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gas with weak attraction between the particles (electrons). It is also assumed 
that these particles interact with the magnetic field through their charge e. 

In §42 we have given the equations (42.5) for the temperature Green’s 
functions of a Fermi gas in the absence of an external field. The introduction 
of the magnetic field is achieved by replacing the operator y by V—ieA/c in 
the Hamiltonian A (7.7).' A similar change therefore occurs in equation 
(7.8) for Ë and correspondingly the change 7 > V-+ieA/c in the similar equa- 
tion for Ê+; it is evident that the same applies to the equations for Y™ and D, 
The spin term (~ o.H), corresponding to the direct interaction of the magnetic 
moment of the electron with the field, is small and may be neglected in the 
Hamiltonian and in the equations. When the operator v acts on the functions 
G(t,r; t’,r’) and GF(z,r; Tt, r’), the operators M(t, r) and H(z, r) respectively 
are differentiated. Hence, in equations (42.5), the magnetic field is introduced 
by the same substitutions V + Y FieA/c. 

The presence of the external field makes the system no longer homogeneous 
in space, and the dependence of the Green’s functions on the arguments r 
and r’ is no longer simply a dependence on r —r’; but the functions still depend 
on t and 7’ only through the difference t— tv’. We shall write down immediately 
the equations for the Fourier components with respect to t—17’: 


i 1 i 2 pereen 
fita [2A] +4) aenn Gon) = 00), 
a j (51.9) 
. l ] 2 Var: m ig ` 
G istr [vao] tulFesr r’)—g5"*G(Cs34, r’) = 0. 
For a weak field, the only case we shall consider here, these equations can 
be linearized; we put 


G= G1G®, F=FOLFO, (51.10) 


where the first terms are the values of the functions in the absence of the field, 
and the second terms are small corrections linear in the field, and we retain 
in the equations only the terms of the first order of smallness in A. 

Here it must be borne in mind that the presence of the field also changes the 
condensate wave function 2, which in this case does not reduce to a constant. 
This complication, however, does not occur with our choice of the vector po- 
tential gauge, in which 

div A = 0. (51.11) 


This is because the first-order correction (to the constant value 5) in the 
scalar function 5(r) could only be proportional to div A, and is zero with the 
condition (51.11). Hence, with the necessary accuracy, we can put gZ = 


+ We put A = 1 in the rest of this section (in equations (51.9)—(51.19)). 
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= g=© = J in the linearized equations, where 4, the gap in the energy spec- 
trum of the gas in the absence of the field, is a real quantity. 
The linearized equations (51.9) then become 


(z. ++ n) Geir r) +A FOr r) 

= (ie/mc) A). GOC; r—r'), (51.12 
(ta te) FOr) IGMCi ne) 

= —(ie/mc) Ar). FOC; r—r’). 


Since these equations are linear in A, it is sufficient to solve them for one 
Fourier component of the field, i.e. 


A(r) = A(k)e", k.A(k) = 0. (51.13) 
With this A(r), the dependence of G@ and ( onr+r’ can be separated imme- 
diately by putting 
M(r.° = e pp’) pik.(rtr’)/2 
Css, r’) = gC r-r’) e : (51.14) 
FHC r, r) = fC; r-r) ek etry, 


For example, the first equation (51.12) then becomes 


1 
t+ mly +34) +a gÈ; r—r')+ 4f (Cs; r—r’) 
= (ie/mc) A(k).e =0-0027 GO(C,; r—r’), 
and similarly for the second equation. We now make a Fourier transformation 
of the functions g and f with respect to r—r’. We finally arrive at the following 

pair of algebraic — 


[t-z (P+ zk) +4] eGo w+ Ara wD | 
| 


= — (emo) AW B® (to Pk) 
(51.15) 


| ~~ (pz) + u| SCP- Aap) 


= (e/me) p.A(k) FO (¢. p— zk) 


After some simple calculations using (42.7) and (42.8) for @® and FF, 
the solution of these equations is found to be 


ony E 
g(s P) = ~ pathy E ee (51.16) 
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where e, = e(p+3k), nı = n(p+ 4k); the function f(C,, p) will not be needed 
below. 

Let us now calculate the current. To do so, we start from the expression for 
the current density operator in the second-quantization representation :* 


j= EVEN P-DE- avy, 


To change to the Matsubara representation of this operator, the Heisenberg 
operators ¥, Ù+ are to be replaced by the Matsubara operators Y™”, ®™, Using 
the definition of the Green’s function (37.2), we find that the current density 
(the diagonal matrix element of the operator j, averaged over the Gibbs distribu- 
tion) may be written 


j@) = 2 ae (7V) Bt, r; T, rear, =r +0—(€?/mc) A(r)n, (51.17) 


where n is the particle number density; the factor 2 comes from G,, = 2G. 

When we substitute G = G+ G™ in (51.17), the term in Y disappears: 
for a homogeneous isotropic system, GZ (r—r’) is even, and its derivative is 
zero for r—r’ = 0. Taking a Fourier expansion with respect to t—t’, we ob- 
tain 


i) = “7 5 [(v’ 9) GE, r, rlr- AO, 


sas em 


and on substituting oe and G” from E and (51.14), 


i= FS |D EAN. 


$= —00 


When g(¢,, p) is substituted here from (51.16), it is convenient to use at the 
same time the fact that the vectors j(k) and A(k) are transverse and to average 
over directions of p, in a plane perpendicular to k, using the formula 


PriPsk = 3 y p° sin? O(0i,.—kiky/k?), 


where @ is the angle between k and p. We thus obtain the following expression 
for the function Q(k) which determines the relation between j(k) and A(k): 


en a ae . o n (stn) (s+ _)+4 Bp ne 
O(k) = © | psin? 6 “rea ETA Gay + me : 


s= — o9 


l a i ane (51.18) 
ee = m+ n= za (PET) = 


t See QM, §115. The term giving the contribution to the current from the particle spin is 
here omitted. For a non-ferromagnetic system, in which the Green’s function Gag = G ôzp, 
this term gives zero on averaging. 
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The integrals and the sum as written here are formally divergent. Although 
these divergences are actually spurious, the calculation must be made with 
caution: until the divergence is removed, the result may depend on the order 
of integration and summation. 

This difficulty can be avoided by making use of the obvious fact that Q = 0 
when A =0: in a normal metal there is no superconductivity current. We there- 
fore do not alter the result by subtracting from (51.18) the same expression 
with 4 = 0: 


oy = FF [esmo [Ei Een 


se +8) (G+) 
1 dp 
=n) n) | oa ee 


This expression is satisfactorily convergent, and the integration and summation 
can be performed in any order. 

First of all, let us note that the relevant values of k are small in the sense that 
k «pp; this inequality simply expresses the fact that the characteristic dis- 
tances over which the field and current vary in a superconductor are large in 
comparison with the distances between the particles, i.e. with ~ 1/pr. 

In (51.19), we first integrate with respect to p. The integral comes mainly from 
a narrow range of momenta near the Fermi surface, | p—p,;| ~k. In this range, 


Ne ~ Ny vrk cos 0 x up (p—pr)+ 4 vrk cos 0; 


the factor p? in the integrand may be replaced by pẹ, and integration over dp 
by one over 2xmp; dn d cos 0. The integral over dn of the second term in the 
braces in (51.19) is then zero: the contour of integration can be closed by an 
infinite semicircle in the complex 7-plane, and the integral vanishes because 
both poles of the integrand are in the same half-plane (upper or lower, depend- 
ing on the sign of ¢,). The integration over dy in the first term in (51.19) is 
elementary, and this leaves only the integral with respect to x = cos 0. With 
the density n from the equation pẹ} = 372n, we have as the final result (in ordi- 
nary units) 





1 
Ome =f M-d O 
O(k) = 4mc 5, f [02+ A24(4 horkx)’] (C2 + A212 (51.20) 
-1 


Cs = (2s+1)aT 


(J. Bardeen, L. N. Cooper and J. R. Schrieffer 1957).* 

In the limit of small k (ko < 1, where £o ~ fivp/do ~ hv;/T, is the coherence 
length), it can be shown that (51.20) reduces to the London expression (51.8), 
which is independent of k, but we shall not pause to prove this. 


t The derivation given here, using the temperature Green’s functions, is due to A. A. 
Abrikosov and L. P. Gor’kov (1958). 
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In the opposite limit, when ko œ 1, the important range in the integral 
(51.20) is x < T,/fkv,; <« 1. We can therefore neglect x? in comparison with 
unity in the numerator of the integrand, and then (because of the rapid con- 
vergence) extend the integration from — œ to œ. The result is 


372ne’T 2 A? 
Qh) = schok (> CLAP 


Carrying out the summation by means of (42.10), we can write this ast 


O(k) = cf/4ak, 


4ane? 3n? A (51.21) 
= / £ 
mE Uio; A tanh FT’ ko => 1. 








When T « T, we have n, ~ n, A ~ do, and B ~ 1/67. When T,—T « T., 
the gap -1 is small, so that tanh (4/27) ~ 4/2T,; using formulae (40.4) and 
(40.5), we again find £ ~ 1/6;&. Thus, at all temperatures from 0 to T, 


B ~ 1/820. (51.22) 


The function O(k) therefore remains approximately constant in the range 
k <s 1/0 (and has a regular expansion in powers of k? near k = 0); outside 
this range, O(k) decreases, as 1/k when k > 1/&. This behaviour of Q(k) 
corresponds to a coordinate function Q(r) that decreases slowly (as 1/r?) in 
the range r S & and rapidly (exponentially) outside that range. Thus the 
correlation between the field and the current always extends to distances ~ So. 
It should be emphasized that this statement is valid at all temperatures from 
zero to T.. We have thus justified the assertion in §44 that £o is universal as a 
characteristic length parameter for superconductivity. 


§ 52. Depth of penetration of a magnetic field into a superconductor 


Let us apply the results of §51 to the problem of the penetration of an external 
magnetic field into a superconductor, which has been analysed in the London 
approximation in §44, 

Let the superconductor have a plane boundary surface and be in the half- 
space x > 0; let the external field H (and therefore the induction B within the 
superconductor) be along the z-axis parallel to the surface. Then all quantities 
depend only on the coordinate x, and the current j and the vector potential A 
(in the gauge with div A = 0) are along the y-axis. Maxwell’s equation curl B = 
= —AA = 4zj/c reduces to 


A" (x) = — 4xj(x)/c, x > 9, (52.1) 
where the prime denotes differentiation with respect to x. 


t A formula of this kind was suggested by A. B. Pippard (1953) from qualitative arguments, 
before the microscopic theory of superconductivity existed. 
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The boundary conditions on this equation depend, however, on the physical 
properties of the metal surface as regards electrons incident on it. The simplest 
case is that of mirror reflection of electrons from the surface. It is evident that, 
with this law of reflection, the problem of a half-space is equivalent to that of an 
infinite medium in which the field A(x) is distributed symmetrically on either 
side of the plane x = 0: A(x) = A(— x). The derivative A’(x), an odd function 
of x, is discontinuous at x = 0, changing sign as x passes through zero. Thus 
the condition B = A’ = § on the surface of the half-space corresponds in the 
infinite-medium problem to the condition 


A'(+0)— A’(—0) = 26. (52.2) 


We multiply (52.1) by e~** and integrate with respect to x from — œ to œ. 
On the left-hand side, we write 


f Ave dx = | (A eny dx+ f (A'eiy dx+ik f A'e`ikx dx, 
So eae ò Š% 


The first two integrals together give — 2, and in the third integral we can sim- 
ply integrate by parts, since A(x) itself is continuous at x = 0. The result is 


29+k2A(k) = Anj(k)/c, 


where A(k) and j(k) are the Fourier components of the functions A(x) and j(x) 
defined through all space. They are related by j(k) = —Q(k) A(k), where O(k) 
is given by the formulae derived in §51. Thus we have for the Fourier compo- 
nents of the field 


_ aw 
A) =~ eR Oe: (52.3) 


The penetration depth ô is defined ast 


ô= 5 f B(x) dx = -2 (52.4) 


— 00 


Expressing A(x = 0) in terms of the Fourier components A(k) and substituting 
the latter from (52.3), we have 


1. dk 1 dk 
jae | 40 = | azote ; (52.5) 
The important range of values of k in this integral is that for which k2~420/c. 
In the London case (when 6, >> £o), these values are small in the sense that 
k&o < 1. Here Q(k) is given by the expression (51.8), which is independent of k, 
and the integration in (52.5) leads, of course, to the value 6 = 6,. 


t When the field decays exponentially, this definition agrees with (44.13). 
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In the opposite (Pippard) case, for which 6, < &, the important values in the 
integral are k >> 1/E. Here Q(k) is given by (51.21), and the integral (52.5) 
gives 

ô = dp = 4/33/2813, (52.6) 


Using (51.22), we thus find for the Pippard penetration depth 
dp ~ (5380). (52.7) 


These calculations have related to the case of mirror reflection of electrons 
from the surface of the metal. In the London case, however, the penetration 
depth is independent of the law of reflection, as is clear from the derivation of 
the value of ô, in §44; when 6 >> £o, the details of the surface structure are 
not significant. 

In the Pippard case also, however, the dependence of the penetration depth 
on the law of reflection is in fact very slight. For example, in the opposite case 
to mirror reflection, namely diffuse reflection, when the velocities of the 
reflected electrons have directions distributed isotropically for any direction 
of incidence, the value of 6, is only a factor of 9/8 larger than for mirror 
reflection. 


§ 53. Superconducting alloys 


The presence of impurities has a much more profound effect on the proper- 
ties of superconductors than on those of normal metals. The corrections to the 
thermodynamic quantities for a normal metal remain small so long as the 
impurity atom concentration x is small, and become considerable only when 
x ~ 1, i.e. when the mean distance between impurity atoms becomes compar- 
able with the lattice constant a. We should emphasize that we are speaking 
here, of course, only of the electronic contributions to the thermodynamic 
quantities, and only of those that are determined by the mean quantum state 
distribution density of conduction electrons in momentum space (for example, 
the specific heat and the magnetic susceptibility in weak fields). 

The situation in superconducting metals is different, because of the existence 
of a characteristic length parameter, which is large compared with a, namely 
the coherence length Ło. Since the scattering of electrons by impurity atoms 
destroys the correlation between electrons, the properties of the superconductor 
may alter considerably when the electron mean free path / is comparable with 
Eo, the concentration x being still small. We shall here describe qualitatively 
the basic results needed for a general understanding of the properties of such 
low-concentration alloys.t 


t The full theory of superconducting alloys by A. A. Abrikosov and L. P. Gor’kov is quite 
complicated and lies outside the scope of this book. See the original papers in Soviet Physics 
JETP 8, 1090, 1959; 9, 220, 1959. 
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Let the impurity atoms have no angular momentum, and therefore no mag- 
netic moment (non-paramagnetic impurities). In such a case, they only slightly 
affect the thermodynamic properties of the superconductor in the absence of a 
magnetic field. The reason is that such impurities do not destroy the symmetry 
under time reversal: the interaction between impurity atoms (distributed in 
some manner) and electrons can be described by specifying a potential field 
U(r). According to Kramers’ theorem, the electron energy levels in such a field 
remain doubly degenerate, and the states corresponding to these levels are 
changed into each other by time reversal, so that electrons in them can form 
Cooper pairs. This will happen, as before, near the sharply defined Fermi 
surface, with the difference that this surface itself now forms the boundary of 
the occupied states not in momentum space but in the space of quantum 
numbers in the field U(r); at low impurity concentrations, the density of quan- 
tum states near the Fermi surface changes only slightly. 

It is therefore clear that, after averaging over the positions of the impurity 
atoms, we must obtain formulae that differ from those of the theory of pure 
superconductors only by corrections that are of the same order of smallness as x. 
When these unimportant corrections are neglected there is, in particular, no 
change in the temperature T, of the transition point and the discontinuity of 
the specific heat at that point (P. W. Anderson 1959). There is therefore also no 
change in the ratio «2/b of the coefficients in the Ginzburg-Landau equation 
(see (45.8)); the form of this equation is independent of the presence or absence 
of impurities, and it is equally valid for pure superconductors and for super- 
conducting alloys. 

On the other hand, the magnetic properties of a superconductor, and in 
particular the magnetic field penetration depth, change considerably even when 
i ~ &, Let us estimate the penetration depth, assuming that the mean free 
path / < & although the concentration x « 1 (A. B. Pippard 1953). 

The collisions of electrons with impurity atoms annihilate the correlation in 
the motion of electrons at distances r 2 /. This means that the kernel Q(r) 
in the integral relation between the current and the field in the superconductor 
will decay exponentially at distances r ~ l « o. Accordingly, in the momen- 
tum representation, the function Q(k) will remain constant in the range k < 1/l. 
The value of this constant may be found by “joining” at kl ~ 1 to formula 
(51.21), which remains valid for k >> 1/1] = 1/&. Thus we find that 


ne IA A 
Q(k) ~ We TE tanh = for kl <1. (53.1) 


The penetration depth ô is determined by the relation k? ~ Q(k)/c when 
k ~ 1/6 (see §52). Using (53.1), we find 


ô ~ op (T = 0) (2 _]"” 
= I tanh (4/2T) 


~ ôE" (T) (Eo/1)¥?; (53.2) 
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for this formula to be valid, we must have 6 >> /, which justifies the use of (53.1). 
The superscript “pure” denotes the value in the absence of impurities, and £o is 
also assumed to have its value for a pure superconductor. The expression 
(53.2) can also be represented by a London formula, taking the number density 
of superconducting electrons in it as 


n, ~ nbure Eq/. (53.3) 
In terms of the coefficients « and b in the Ginzburg-Landau equation, the rela- 
tion (53.2) implies (see (45.16)) that 
bja ~ (b/«)pure Eo/I. 


Since also, as mentioned above, «?/b is independent of the presence of impurities, 
we find that 


4 ~ &pure &o/l, b~ bpure (&o/ DP. (53.4) 
According to (45.17), we hence have for the correlation radius 
E(T) ~ &(T)pure (1/E0)!? (53.5) 


and for the parameter x (45.18) 


x ~ Xpure Ẹo/l >> Xpure- (53.6) 


When the mean free path is sufficiently short, x exceeds 1/+/2, and so sufficient- 
ly “dirty” superconductors are of the second kind. 

The range of applicability of the Ginzburg-Landau equations to “dirty” 
superconductors is limited at low temperatures, in practice, only by the con- 
dition T,—T « T, The necessary inequality 6(7) >> / is equivalent in this case 
to the weaker condition 


TTT x Bue(Eoll® ~ Eol. 





Finally, some comments on the properties of superconductors containing 
paramagnetic impurities. Such impurities destroy the symmetry of the system 
under time reversal, and therefore the pairing of electrons (when magnetic 
moments are present, time reversal implies a change in their sign, i.e. essen- 
tially replaces one physical system by another). A quantitative measure of the 
influence of these impurities on the properties of the superconductor is the 
mean free path /, for scattering with change of spin direction (due to the 
exchange interaction with impurity atoms). The superconductivity disappears 
when the concentration x reaches the critical value at which /, ~ čo. 

Actually, however, there are two critical concentrations, of the same order 
of magnitude. At the lower (x1), the gap A in the energy spectrum becomes Zero; 
the condensate wave function Æ, however, becomes zero only at some con- 
centration xz > xı. At concentrations between xı and xə, there is zero-gap 
superconductivity. Since, in deriving the London equation in §44, we used only 
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the existence of the condensate function and considerations of gauge invar- 
iance, it is clear that the basic properties of the superconductor (the supercon- 
ductivity current, and the Meissner effect) will continue to exist in this range. 
The absence of a gap in the spectrum is shown (in the equilibrium properties of 
the superconductor) by a non-exponential temperature dependence of the spe- 
cific heat. There is no conflict with the Landau condition for superfluidity 
(§23), because in disordered systems (such as the alloys in question) this condi- 
tion does not apply, the elementary excitations having no definite momentum.* 


§ 54. The Cooper effect for non-zero orbital angular momenta of the pair 


It has been mentioned several times that the occurrence of superfluidity in a 
Fermi system is due ultimately to the Cooper effect, the formation of bound 
states (pairing) by mutually attracting particles at the Fermi surface. For a 
Fermi gas, the condition of attraction is formulated as that of a negative 
scattering length a = f U dx, i.e. a positive scattering amplitude for two 
particles in a state with zero orbital angular momentum of their relative motion, 
{ = 0 (this state makes the main contribution to the scattering at low energies). 

A much stronger statement is valid, however: pairing (and therefore super- 
fluidity) occurs if the interaction is attractive even for just one value of the 
angular momentum / (L. D. Landau 1959). We must emphasize that the 
system in question is an isotropic one (liquid or gas), in which the states can be 
classified according to the value of /. 

The statement will be proved fora Fermi gas bya method that in principle 
allows us to determine the temperature T, at which the transition to the super- 
fluid state occurs, using only the properties of the system (normal Fermi gas) 
at temperatures T > T, 

It has been noted in §18 that, in the mathematical formalism of Green’s 
functions for a normal Fermi system, the energy of the bound state of the pair 
of particles appears as the pole of the vertex function I’; the same is true (when 
T = 0) of the temperature vertex function, which we denote by t. When such 
a pole appears, the whole formalism in fact becomes inapplicable, but it is still 
applicable at the instant when, with decreasing temperature at T = T,, a pole 
appears; the binding energy of the pair must then be zero, and the states of the 
superfluid and normal phases coincide. 

In the skeleton diagram 





t See also the fourth footnote to §39. The theory of zero-gap superconductivity is 
described in the original paper by A. A. Abrikosov and L. P. Gor’kov, Soviet Physics JETP 
12, 1243, 1961. 
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the circle denotes —“©. The transition point T, is determined, from the above, 
as the temperature at which © has a pole with 


Csi = Coo = 0, pitpe = 0. (54.1) 


The first equation states that the particles paired are on the Fermi surface, 
and that the binding energy of the pair is zero; the second equation states that 
the particles paired have opposite momenta. 

The pairing of particles occurs if they have any attraction, however slight. 
It is clear that, if a pole is to occur, the perturbation-theory series for the vertex 
function must include terms containing integrals that diverge with the condition 
(54.1) and as T, + 0 (T, is small when the attraction is weak); otherwise, 
all corrections to the (finite) first-approximation term would certainly be small 
in comparison with that term at all temperatures, and no pole could occur. 

This requirement is satisfied by the series of ladder diagrams 


3 1 
3am l 3 e—a e| Ja al MMM | 
sea eg a, G48) 
2 G atemat 2 Gana qaal ala? 
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As will be seen below, in all these diagrams (from the second onwards) the 
smallness with regard to the interaction (from the adding of the broken lines) 
is compensated, in the sense mentioned, by the divergence of the integrals.t 

Applying to this series the procedure used to obtain (17.4) from (17.3), we 
find that the equation (54.2) is equivalent to the diagram equation 


B R 
TIRE = 17A (54.3) 
- - -p amd 

Z 5y Fs R R R 

-3 f} 


The external and internal lines in the diagrams correspond to the arguments 
which are shown in (54.3) with (54.1) taken into account: 


Py me (0, Pi), P; = (0, p3), Q oF (ibs, q). 


The spin dependence of the Green’s functions of an ideal gas is separated in 
the form GO = 6G and the spin dependence of the vertex function (with- 
out antisymmetrization) in the form 


Dyo, op(Ps, Ps; Pi, P2) = 52,5p0 O(Ps, Pa; Pi, Po). 


t The diagrams (54.2) should be supplemented by a further series of diagrams with the 
external lines 3 and 4 interchanged, making the vertex function antisymmetrical with respect 
to its spin and orbital arguments. However, for the present purpose of determining T, this 
is not necessary, since the pole appears simultaneously in both these parts of the vertex func- 
tion. 
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Expanding the diagrams (54.3) by the rules given in §38 and cancelling the spin 
factors, we obtain for © the integral equation 


“O(p3, — Ps; Pi, — P1) 
+T ¥ | U@s—q) BC, a) G-t, -@) x 


S= ~ o0 


d 
x Oq, —4; P —P1) ne = U(pi—ps). (54.4) 


In the sum and integrals here, small values of the discrete variable ¢, are 
important, and values of q near the Fermi surface (see below). Hence we can 
put ¢, = Oand q = pr in the factors U and © in the integrand. The vectors pı 
and pz are also on the Fermi surface. Thus the functions © and U in (54.4) 
will each depend on only one independent variable, the angle between any two 
of the three vectors Pı, ps and q on the Fermi surface. 

Equation (54.4) can now be solved by expanding U and © in series of Le- 
gendre polynomials: 


U(8) = Ș (2l+1) a,P(cos 8), 

= . (54.5) 
OP) = ¥ (21+ 1) G,Pi(cos 9), 

[=0 


where ® is any of the angles mentioned. Substituting these expansions in (54.4) 
and integrating by the addition theorem for spherical harmonics, we find 


O,1+alI) = a, (54.6) 
where 


T = dq 
(0) 12 a fect Sate 
iI = T pi a 0, (Čs, q); ai ~ (Qn pace E+N? 9 


sS=— oo 


(54.7) 


the function @ is taken from (37.13), and n, = g?/2m—u ~ U-(4—Pr)- 
According to the summation formula (42.10), 


3 
E E E LA = (54.8) 


Fhe divergence of the integral over dg = dy/v, at the upper limit is Spurious 
(cf. the last footnote to §39), and the integral must be cut off at some 7 ~ é,." 
As T - 0, however, the integral diverges logarithmically at the lower limit also, 
i.e. behaves as log (1/T). 


t Because of the rapid convergence of the sum over s in (54.7), only small values of ¢, are 
actually important, and the logarithmic form of the integral over dq justifies the assumption 
that g is close to pp. 
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It is seen from (54.6) that ©, becomes infinite (i.e.“© has a pole) under the 
condition 
}+alT = 0. (54.9) 


This equation has the same form as that which determines the transition point 
for pairing with / = 0, differing from it only in that the “coupling constant” 
g is replaced by —a, (cf. (42.11)); taking this as an equation to determine T,, 
we must put 4 = 0, and e(p) then coincides with 7,. We see, therefore, that the 
vertex function has a pole if any of the quantities a, is negative; the transition 


temperature is then 
T® = (yé,/2) exp (—2/| ay! vF); (54.10) 


cf. (40.4) and (39.19). If a, < O for several different values of /, the transition 
occurs at a temperature T corresponding to the largest | a,'." 

It can be shown that for any Fermi gas (or liquid) consisting of electrically 
neutral atoms, the quantities a, must always become negative for sufficiently 
large / (L. P. Pitaevskii 1959). The reason is that, in the interaction of neutral 
atoms, there is always a range of (large) distances at which it is an attraction 
(the van der Waals attraction). 

In an actual liquid of this kind, the liquid isotope He’, the occurrence of 
superfluidity seems to be due to pairing with / = 1.* We shall not consider 
here the structure of the superfluid phase, and shall discuss only briefly the 
choice of the order parameter which distinguishes this from the normal phase. 
The anomalous Green’s function Fst, r1; t, rə) = F,,(f¥1—¥2) is a quantity 
that is zero above the transition point but not below it; as already mentioned 
in §41, it acts as the wave function for bound pairs of particles. Its Fourier 
component F,,(p) on the Fermi surface (i.e. for p = 2p,n) is a function of the 
direction n, and not a constant as in pairing with / = 0. Since the y operators 
anticommute, the function F,,(n) is antisymmetrical, as it should be, with 
respect to the interchange of particles: F,,(m) = — F,,(—n). 

In pairing with / = 1, as with any odd angular momentum, F,, is an odd 
function of n, and so it is a symmetrical spinor. This means that the spin of the 
pair is 1, as it should be for a state of two equal fermions with odd /. A sym- 
metrical spinor of rank two is equivalent to a vector, which we denote by d. 
In the case / = 1, the dependence of d on n must correspond to the Legendre 
polynomial P\(cos 6), i.e. must be linear, d; = Yin, The complex tensor y; 
of rank two (not necessarily symmetrical) describes the superfluid phase. There 
are in fact two different superfluid phases of liquid He3, with different forms 
of the tensor Yy. 


Ì If all a; > O there is no transition, and formula (54.6) for ©, is valid at all temperatures 
down to zero. Then all “©, tend to zero as 1/\log T' when T— 0. This is an instance of the fact, 
already mentioned in the last footnote to §6, that when T = 0 the function “© (and therefore 
the quasi-particle interaction function/) is zero for particles with opposite momenta. 

ł The transition occurs at a temperature ~ 1073°K. The smallness of T, ensures the existence 
of a range in which the theory of the normal Ferm: liquid is applicable to liquid Heè. 
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ELECTRONS IN THE CRYSTAL LATTICE 


§ 55. An electron in a periodic field 


THE electron clouds of the atoms in a crystal interact strongly with one another, 
and in consequence we cannot speak of the energy levels of individual atoms, 
but only of levels for the electron clouds of all the atoms in the body as a whole. 
The nature of the electron energy spectrum is different for various types of 
solids. As the first step in the study of these spectra, however, we must consid- 
er the more formal problem of the behaviour of a single electron in a spatially 
periodic external electric field, which serves as a model of a crystal lattice. 
This subject is dealt with in §§55-60. 

The periodicity of the field signifies that it is unaltered by a translation of the 
form a = 1ya;+MN2a2+N3a3 (where aj, 22, a3 are the basic lattice vectors and 
11, Hg, ng are any integers): 

U(r+a) = U(r). (55.1) 


Hence the Schrédinger’s equation describing the motion of the electron in such 
a field is invariant under any transformation r > r+a. Thus, if y(r) is the wave 
function of a stationary state, then y(r+a) is also a solution of Schrédinger’s 
equation describing the same state of the electron. This means that the two 
functions must be the same apart from a constant factor: y(r+a) = constant 
x (r). It is evident that the constant must have unit modulus; otherwise, 
the wave function would tend to infinity when the displacement through a 
(or —a) was repeated indefinitely. The general form of a function having this 
property is 

Wa(r) = e*uy(r), (55.2) 


where k is an arbitrary (real) constant vector and u,, is a periodic function: 
u(r +a) = usr). (55.3) 


This result was first derived by F. Bloch (1929); wave functions of the form 
(55.2) are called Bloch functions, and in this connection an electron in a periodic 
field is often called a Bloch electron. 

For a given value of k, Schrédinger’s equation has in general an infinity of 
different solutions corresponding to an infinity of different discrete values of 
the electron energy e(k); these solutions are labelled by the suffix s in y 
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A similar suffix, the number of the energy band, must be assigned to the various 
branches of the function e= «,(k), the electron dispersion relation in the period- 
ic field. In each band, the energy takes values in a certain finite range. 

For different bands, these ranges are separated by “energy gaps” or partly 
overlap; in the latter case, in the overlap region, each value of the energy cor- 
responds to values of k that are different for the various bands. Geometrically, 
this means that the constant-energy surfaces corresponding to two overlapping 
bands s and s’ are in different regions of k-space. Formally, the overlapping 
of the bands signifies degeneracy : different states have the same energy, but this 
does not cause any peculiarity in the spectrum, because they correspond to 
different values of k. The general case of overlapping is to be distinguished from 
the intersection of bands, when the values of «,(k) and ¢,(k) coincide at par- 
ticular points k (the constant-energy surfaces intersect). Degeneracy is usually 
understood to refer only to such a case; intersection leads to the occurrence 
of certain peculiarities in the spectrum. 

All the functions yy, with different s or k are, of course, orthogonal. In 
particular, the orthogonality of y,, with different s and the same k implies 
that the functions u, are orthogonal. Because they are periodic, it is sufficient 
to integrate over the volume v of one lattice cell; with the appropriate normali- 


zation, 
f užu dv = dye". (55.4) 


The significance of the vector k is that it determines the behaviour of the 
wave function under translation: the transformation r — r+a multiplies it 
by eka, 

Yalt +a) = dyar). (55.5) 


Hence it follows immediately that kis by definition not unique: values differ- 
ing by any vector b of the reciprocal lattice give the same value of the wave 
function; the factor exp {i(k+b).a} = exp (ik.a). That is, such values of k 
are physically equivalent; they correspond to the same state of the electron, 
i.e. to the same wave function. We can say that the functions y,, are periodic 
(in the reciprocal lattice) with respect to the suffix k: 


Ws, k+b(F) = Ysk(T). (55.6) 


The energy also is periodic: 
es(k +b) = e,(k). (55.7) 


The functions (55.2) have a certain similarity to the wave functions of a free 
electron, the plane waves y = constant x exp (ip.r/fi), the conserved momen- 
tum being replaced by the constant vector Ak. We again arrive, as with the 
phonon (see Part 1, §71), at the concept of the quasi-momentum of an electron 
in a periodic field. [t must be emphasized that in this case there is no actual 
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conserved momentum, since in the external field there is no law of conserva- 
tion of momentum. It is noteworthy, however, that in a periodic field an elec- 
tron is still characterized by a certain constant vector. 

In a stationary state with a given quasi-momentum Ak, the actual momentum 
can have, with various probabilities, an infinity of values, of the form A(k +b). 
This follows because the Fourier-series expansion of a function periodic in 
space contains terms e/?*: 


usht) = Y ds, k4p cP, 
b 


and the expansion of the wave function (55.2) in plane waves is therefore 


PoklF) = F ds, pp CEt, (55.8) 
b 


The fact that the expansion coefficients depend only on the sums k +b expresses 
the property of periodicity in the reciprocal lattice (55.6). We must emphasize 
that this fact, like the property (55.6), is not an extra condition imposed on the 
wave function, but follows necessarily from the periodicity of the field U(r). 

All physically different values of the vector k lie in one unit cell of the recip- 
rocal lattice. The “volume” of this cell is (27)?/v, where vis the volume of the 
unit cell of the crystal lattice itself. On the other hand, the volume in (k/2z)- 
space determines the number of corresponding states (per unit volume of the 
body). Thus the number of such states in each energy band is 1/v, i. e. the num- 
ber of unit cells per unit volume of the crystal. 

In addition to being periodic in k-space, the functions ¢,(k) are symmetrical 
under the rotations and reflections that correspond to the symmetry of direc- 
tions (the crystal class) of the lattice. Independently of the presence or absence 
of a centre of symmetry in the crystal class concerned, we always have 


e,(—k) = «,(k). (55.9) 


This property is a consequence of the symmetry under time reversal: because 
of the symmetry, if pg, is the wave function of a stationary state of the electron, 
the complex conjugate function y% also describes a state with the same energy. 
But y% is multiplied by e~** on translation, i.e. corresponds to a quasi- 
momentum —k.' 

Next, let us consider two electrons in a periodic field. Regarding them to- 
gether as one system with the wave function y(r, r2), we find that-this function 
must be multiplied by a factor e** on translation, where k may be called the 
quasi-momentum of the system. On the other hand, for large distances between 


t In the presence of overlapping bands, it follows from these considerations, strictly speak- 
ing, only that ¢,(—k) = ¢,(k), where s and s’ are the numbers of certain bands. Equation 
(55.9) can, however, always be satisfied by appropriately numbering the various branches of 
the function e(k). 
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the electrons »(r1, re) reduces to the product of the wave functions of the indi- 
vidual electrons, and is multiplied by ee" on translation. Since the two 
forms of the factor must be same, we find that 


k = kı +k +b. (55.10) 


In particular, it follows from this that, in a collision of two electrons moving 
in a periodic field, the sum of their quasi-momenta is conserved to within a 
reciprocal lattice vector: 


ki tke = ki +k; +b. (55.11) 


A further analogy between the quasi-momentum and the actual momentum 
is shown by determining the mean velocity of the electron. The calculation 
requires a knowledge of the velocity operator ¢ = Ê in the k representation. 
The operators in this representation act on the coefficients c,, in the expansion 
of an arbitrary wave function in terms of the eigenfunctions y,, : 


Y= F, f E&Y Pk. (55.12) 


Let us first find the operator f. We have identically 


Vee f Esty Bk 


= = 5 | ea Ck (| —i oi + ieik-r oe) ak, 








ok 


In the first term we integrate by parts, and in the second we expand the func- 
tion Ou,,/0k, which is periodic (like u), in terms of the mutually orthogonal 
functions uy, with the same k: 


Ous/OK = —i F (sk| Q| sk) usk, (55.13) 


where (sk | @| s’k) are constant coefficients. We then find 





ry = 2 | iva IW ok a Bk 4 2 Csu(Sk | Q| s'K) Woy d'k 


gji 


From the definition of the operator f, we must have 
rp = Y f (Pes) Pa dk. 
S 





; OCs E+E OID sk) cr] Psk Bk. 


Comparison with the above expression gives 


+6, (55.14) 


= 


m) 
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where the Hermitian operator Ĝis specified by its matrix (s’k | Q | sk). It is im- 
portant to note that this matrix is diagonal ink, and the operator ¢2 therefore 
commutes with the operator k = k. 

The velocity operator is found, according to the general rules, by commuting 
the operator f with the Hamiltonian of the electron. In the k representation, 
the Hamiltonian A is a diagonal matrix (with respect to k and with respect 
to the band numbers s) with elements ¢,(k).' The operator 0/0k, which acts 
only on the variable k, is diagonal with respect to s. Hence, in the expression 


= Z (m-th) 


1 © ð a 
the first {erm is a diagonal matrix with elements 


1 a a 1 e,(k) 
-g (eoa ee) = HF GE 


The matrix elements of 92 are related to those of Q by 





(sk | Q! s'k) = (i/h) [es(k)— ev(k)] (sk | Q | s’k); 


this is zero when s = s’, i. e. & has no elements diagonal with respect to the 
band number. Thus we have finally for the matrix elements of the electron 
velocity 


(sk |v| sk) = ðes(k)/ðk, (sk |v!s’k) = (sk|Q's'k) (s= s’). (55.15) 


The diagonal elements of this matrix are the mean values of the velocity in 
the corresponding states. These values are therefore given as functions of the 
quasi-momentum by 

V, = 0e,(k)/A Ok, (55.16) 


which is entirely analogous to the usual classical relation. 

So far the discussion has ignored the existence of the electron spin. When 
relativistic effects (the spin-orbit interaction) are neglected, the inclusion of the 
Spin simply causes a twofold degeneracy of each energy level with a given value 
of the quasi-momentum k, there being two values of the spin projection on any 
fixed direction in space. When the spin-orbit interaction is taken into account, 
the situation differs according as the crystal lattice does or does not have a 
centre of inversion. 


t More precisely, it is the ks representation. We may recall that the wave functions Cy in 
this representation are not completely arbitrary, but must be periodic in k. 
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The spin-orbit interaction for an electron in a periodic field is described by 
the operator 
i 2 


1 = Fme X VUV: (55.17) 


A, 
where o is the Pauli matrix vector (see ROT, §33). The wave functions on which 
this operator acts are spinors Ya of rank one, where « is the spinor index. 
According to Kramers’ theorem (see QM, §60), which relates to any electric 
field (periodic or not), the complex conjugate spinors Ya and Ypa always 
describe two different states with the same energy. Since also the function Y% 


corresponds to the quasi-momentum —k, we again obtain (and now with 
inclusion of the spin-orbit interaction) a relation of the type (55.9): 


€so( —K) = &s(k), (55.18) 


where o and o’ denote two different (time-reversed) spin states. 

Equation (55.18) does not, of course, imply degeneracy in the sense consid- 
ered above, since the energies on the two sides of the equation refer to differ- 
ent values of k. If the lattice has a centre of inversion, however, the states k 
and —k have the same energy. We then get £, (K) = ¢,,(k), which again sig- 
nifies double degeneracy of each level with a given quasi-momentum. 

As well as the degeneracy due to the symmetry under time reversal, for an 
electron in a periodic field there can also be degeneracy due to the spatial 
symmetry of the lattice. This is dealt with in §68. 


PROBLEMS 


PROBLEM 1. Find the dispersion relation for one-dimensional motion of an electron in the 
periodic field shown in Fig. 10 (R. de L. Kronig and W. G. Penney 1930). 





Fic. 10. 


SOLUTION. The wave function in the well region I (0 < x < a) is of the form 
y = ce" tee", x, = 4/(2me)/A, (1) 
and in the barrier region II (— b < x < 0) 
y = cae” + ce", xg = +/[2m(e— Up)/Al- (2) 
t When the spin-orbit interaction is taken into account, the spin projection operator does 


not commute with the Hamiltonian; the spin projection is therefore not conserved, and the 
spin states cannot, strictly speaking, be characterized by this number. 
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In the next barrier region IHI, the wave function can differ from (2) only by the phase factor 
ef(o+>), a+ b being the period of the field: 


y= eth tee" y e eT), (3) 


The conditions of continuity of y and y’ at x=0 and x=a give four equations for cy, ..., C45 
the condition for these equations to be compatible leads to the dispersion relation 


cos k(a+ b) = cos #3a.cos #2b— 5 (2 +") sin x,a.sin xab, (4) 
1 2 


which implicitly determines the required relation e(k). For e < Up, x2 is imaginary, and the 
equation must then be written 

cos k(a+ b) = cos x,a cosh |,5| gal (2! — a) sin x,a sinh | #25}. (5) 

2\ x4, |X| 
If we take in (5) the limit Up — œ, b — O with Ub = constant = Pa, we obtain the disper 
sion relation 
fo 
Pma sin x,a , (6) 


cos ka = cos x3a+—— 
h? %14a 





This solves the problem of energy levels in a periodic field composed of delta-function peaks: 
U(x) = aP È òl(x—an). 
n 
Figure 11 shows a graphical plot of the distribution of roots of equation (6). The right-hand 


side is represented as a function of x,a; when it takes values between +1, the roots of the 
equation take values in the ranges shown by the thick segments of the horizontal axis. 
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PROBLEM 2. Find the dispersion relation for one-dimensional motion of a particle in a weak 
periodic field U(x). 


SOLUTION. Regarding the field as a small perturbation, we start from the zero-order 
approximation, in which the particle is in free motion described by a plane wave 


yo (x) = ( Na)7 2 etkz 


(with normalization to one particle in the length Na, where a is the period of the field); the 
Particle energy e = #?k*/2m. We write the periodic function U(x) as a Fourier series: 


Ux)= Yi Upele, 
n= — oo 
The matrix elements of this field with respect to the plane waves are zero except for transitions 
between states with wave numbers k and k’ = k+2z2n/a; for these, they are Uy, = Un. 
In first-order perturbation theory, the correction to the energy is given by the diagonal 
matrix element « = Uy, = Up, a constant independent of k, which merely shifts the origin 
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of energy. An exception occurs, however, for the energy levels near k = anfa (n= +1, 
+2,...). At these points, k differs only in sign from k’ = k—2nn/a,so that the energies e(k) 
and €)(k’) are equal. Near these values, therefore, the matrix elements are non-zero for transi- 
tions between states with similar energies, and to find the correction we must use perturbation 
theory as applied to the case of close eigenvalues (see QM, §79). The answer is given by QM 
(79.4), according to which we have in the present case 


elk) = Hek) + eOk- K,) + 
+ Hel(K)— EAk- Ka] IUl, 


where K, = 27n/a, and the additive constant U is omitted; the choice of the sign of the square 
root is determined by the requirement that, far from the value k = ++4K,, the function e(k) 
becomes ¢")(k): the signs + and — refer respectively to the ranges | k| > |4K,| and |k!< |$K,|. 
At the points k = +4XK, themselves, the function e(k) has a discontinuity of 2! U,,|. In Fig. 
12a the energy ¢(k) is shown as a function of the variable k, which runs from — æ to oo. If k, 
the quasi-momentum,, is restricted to the range between +7x/a, we have Fig. 12b, which shows 
the first two energy bands. 





It should be noted that the bands in Fig. 12, as in Fig. 11, do not overlap. This is a general 
property of one-dimensional motion in a periodic field. Each energy level is doubly degenerate 
{with respect to the sign of k),and no higher degree of degeneracy is possible in one-dimension- 
al motion. Moreover, in the one-dimensional case the limits of each band, the minimum and 
maximum values of e(k), correspond to k = 0 and k= n/a. This is because the wave functions 
corresponding to energies in the forbidden range are multiplied by some real factor upon dis- 
placement through the period a, and therefore increase without limit at infinity. The wave 
functions in the allowed energy ranges are multiplied by e° upon such a displacement. At the 
boundary between the allowed and forbidden ranges, this factor must consequently be a 
quantity both real and of unit modulus, whence it follows that ka = 0 or z. 


PROBLEM 3. Find the dispersion relation for a particle in a one-dimensional periodic field 
consisting of a sequence of symmetrical potential wells satisfying the condition of quasi-clas- 
sicality (so that the probability that the particle penetrates the barrier between the wells is 
small). 


SOLUTION. This is similar to the solution of the problem of level-splitting in a double well 
(QM §50, Problem 3). Let wo(x) be the normalized wave function describing the motion 
(with a certain energy £o, Fig. 13) in one of the wells, i.e. decaying exponentially beyond the 


U(x) 
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limits of this well on either side; it may be either an odd or an even function of x. The correct 
wave function in the zero-order approximation for the motion of a particle in a periodic field 
is the sum 


vio =C F ey (x—an), (1) 


a= -0 


where C is a normalization constant; the displacement x x+a multiplies this function by 
e**s as it should. 
Schrédinger’s equations are 

py + (2m/h*)fe(k) — U(x)] Y = 0, 

Yo +(2m/h*)[eo— U(x)] Yo = 0; 
we multiply the former by Y9, the latter by y+, subtract corresponding terms, and integrate with 
respect to x from —4a to 4a(Fig. 13). Since the products yo(x)yo(x— an) with n = 0 are 
everywhere negligible, we have 

a/2 


f vel) polx) dx  C. 
—e/2 


The result is 
e(k)— €(0) = (A?/2mC) [viye—vovil”, e 


At x = ła, only the terms with n = 0 and 1 need be retained in the sum (1); yo(—}@) = 
+ po(} a), according as the function yo(x) is even or odd: 


vu Za) = Cy(F a) (Ite), 
y, (Fa) = Cyo(} a) (1 Fe); 
similarly, when x = — ła, only the terms with n = 0 and —1 need be retained. The result is 
€(k)— £o = +(2h?/m) Yol a) yol a) cos ka. 


Here we must substitute 


| J 
po(ża) = al ° / ol- Í oc , 


mee 


yoda) = PED G a), 


where w is the classical frequency of oscillations of the particle in the well, and xo is the turning 
point, corresponding to the energy £o. The final result is 
e(k)— £ = + (w/z) VD cos ka, 


af/2 
D = exp |-4 Í eae 


To 


Thus each energy level £, for the motion of a particle in an isolated well is broadened into a 
band with small width 2w D/z, dependent on the transmission coefficient of the potential 
barrier between the two wells. 
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§ 56. Effect of an external field on electron motion in a lattice 


Let us consider the motion of an electron when a constant magnetic field 
H is applied to the lattice. If we start from the Hamiltonian of an electron in a 
periodic field U(r) in the coordinate representation, 


A = p?/2m+ U(r), (56.1) 


where Ð = — ihv is the operator of the actual momentum, the external mag- 
netic field is introduced in the usual way: 


p= = (e—<a) + U(r), (56.2) 


where A(r) is the vector potential of the field. The problem is, however, great- 
ly simplified (in the case of a sufficiently weak field) by changing to the quasi- 
momentum representation. 

Because of the great variety of the possible forms of the band structure in 
the energy spectrum of an electron in a lattice, the condition for the external 
field to be small can be generally formulated only in a fairly crude way. Let 
the electron be in some sth band before the field is applied. Let £o denote the 
smallest of the energy characteristics for this band, such as its characteristic 
width, or the distance to the neighbouring bands, i. e. the difference e (k)— 
—e,(k) for given k. If the magnetic field may be supposed weak, the following 
condition must certainly be satisfied : 


hwg «< E0, (56.3) 


where the “Larmor frequency” wyg ~ |e'H/m"c,and m” ~ hk/v is the effec- 
tive mass of the electron.? 

In the absence of an external field, the Hamiltonian of the electron in the 
lattice in the k representation is, as already mentioned, a diagonal matrix with 
elements ¢,(k). In the presence of the field, the Hamiltonian will also contain 
the potential A(r) and its derivatives with respect to the coordinates, the field 
H (and in a non-uniform field also the derivatives of the field); in the k rep- 
resentation, the function A(r)is replaced by the operator A = A(f), where f 
is the operator (55.14). 

The potential A(r) is an increasing (for a uniform field, linearly increasing) 
function of the coordinates. Because of this increase, the potential is not, even 
for a weak field, a small perturbation in the Hamiltonian of an infinite system 
(an electron in a lattice). For this reason, even a weak magnetic field consid- 
erably alters the properties of a large system, making the spectrum discrete 


t A more precise definition of the frequency is given in (57.7) below. For conduction elec- 
trons in a metal (see §61), the characteristic values of k ~ 1/a (where a is the lattice constant); 
putting also £ ~ h?/m*a*, we find that the condition (56.3) is equivalent to rg >> a, where the 
“orbit radius” rg ~ v/wy. 


§ 56 Effect of an External Field on Electron Motion in a Lattice 233 


instead of continuous; it quantizes the levels (see §58). The weak field intensity, 
in contrast to the potential, gives only small corrections. 

We shall show that, when these corrections are neglected, the dependence of 
the Hamiltonian on the field potential may be ascertained in a general form by 
using only the requirements of gauge invariance. Since we are considering con- 
stant fields, it is sufficient to use the invariance of the equations under time-inde- 
pendent transformations of the potential and the wave functions, of the form 


A>+A+Vf, yy exp (ief/hc), (56.4) 


where f(r) is an arbitrary function of the coordinates; see QM, (111.8), (111.9). 
In a weak field, the potential A(r) is a slowly varying function of the coordi- 
nates. In order to elucidate the significance of this slowness, let us first consider 
the limiting case of a constant potential, A(r) = constant = Ao (the constant 
potential is of course fictitious, there being then no actual field, so that a formal 
transformation is in question). The change from A = Oto A = Ao is equivalent 
to the transformation (56.4) with f = Ao.r; hence, instead of the original eigen- 

functions (for A = 0) 
Yu = Usk ou (56.5) 


the eigenfunctions of the new Hamiltonian will be 
Usk (r) exp f: (x + Ao) xl 


From this we see that, in order to give the quasi-momentum its previous signifi- 
cance as a quantity determining the change in the phase of the wave function 
in translations, we must put k+eAo/fic = K; the quantity K thus defined may 
be called the generalized quasi-momentum. Then the new eigenfunctions are 


Ysk = Us, K—eAo/ħc(T) eiK.r, 
and the corresponding electron energy values are ¢,(k) = ¢,(K—eAo/fc). 

We can now assert that, for a potential A(r) that is not constant but varies 
slowly in space, the wave functions of the “zero-order” approximation (with 
respect to the field) are 

Ysk = Us, K—eA(r)/Ac e Rare (56.6) 


the functions u are no longer strictly periodic, because of the variability of 
A." The energies ¢,(K —eA/fc) must now be regarded as operators forming the 


t Jf the functions (56.6) are expanced in terms of the y,,, the expansion will in general in- 
clude functions with various s. It must be emphasized, however, that this does not signify an 
actual transition to a different band, but simply expresses the change in the wave function un- 
der the action of the constant field. In this connection it may be recalled that a constant field 
cannot cause an actual transition with change of energy. To understand the position, it should 
be remarked that, although the field is weak, the resulting change in the classification of states 
(including the relation between quasi-momentum and energy) is considerable. 
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Hamiltonian in the K representation. In this representation, moreover, Î is to 
be understood as the operator f = i0/@K, the second term (£2) in the definition 
(55.14) being omitted. For, when the operator i0/0K acts on the wave function, 
the effect, in order of magnitude, is to multiply it by the “orbit dimension” ry, 
which increases as the field decreases; the result of the action of the operator 2 
on the wave function does not contain such an increasing factor. In this sense, 
© in a weak field is small compared with i0/OK. Since, on the other hand, the 
operator 0/@K is diagonal with respect to the band numbers, the Hamiltonian 
also is diagonal. 

We thus conclude that the motion of an electron in a weak magnetic field is 
described by the Hamiltonian (in the K representation) 


me 


K (56.7) 


Ĥ, = & (x-4), =i 
(R. E. Peierls 1933). In this approximation, therefore, there is a complete anal- 
ogy with the introduction of the magnetic field in the Hamiltonian of a free 
particle in the momentum representation. 

The expression (56.7) is still not fully determinate, as the order of action of the 
non-commuting operators (the components of the vector k = K—eA/fic) has 
not been specified. This must be determined in such a way as to make the Hamil- 
tonian Hermitian, which can in principle always be achieved by expressing the 
function «,(k), which is periodic (in the reciprocal lattice) as a Fourier series 


elk) = } As e™:*; (56.8) 


the summation is over all vectors a in the original lattice. After the change 
k — k, the exponent of each term in this series will contain only one operator 
(the projection of the vector A on a), so that the question of the order of action 
does not arise, everything reducing to powers of this one operator. This method 
of “Hermitianization” is, of course, not the only one, but it is important that 
the difference between the different methods lies beyond the range of the approx- 
imation considered, since the commutators of the operators k,, k,, k, are small 
quantities in this approximation. For instance, in a uniform field the operator 


A =+H xf = +iHxX0/OK; (56.9) 
a direct calculation easily shows that the commutators 
kxk, — kyk, =i(e/hc) Hz, ..., (56.10) 


are proportional to the low field strength H. 


The operators f = /0/0K and K=K obey the same commutation rules as the 
coordinates and generalized momenta of a “free” particle (without the lattice). 
It is therefore natural that the calculation of the commutators of these operators 
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with the Hamiltonian gives the operator equations 
kK = —əĤ/ðr, Ê= 0A/KOK, (56.11) 


which have the form of the usual Hamilton’s equations; for the calculation 
see QM (16.4) and (16.5). 

We repeat that the Hamiltonian (56.7) is approximate in the sense that it 
omits all terms depending on the field H and not containing large factors of the 
order of the orbit size ry. In subsequent approximations, the result may again 
be expressed as an effective Hamiltonian 4,(K—eA/fc, H), diagonal with 
respect to band numbers but not expressible in terms of the functions e,(k) 
only." 

If the spin-orbit interaction is neglected, the inclusion of the electron spin 
causes the Hamiltonian to contain the usual term describing the interaction 
of the magnetic moment with the field: —8o.H, where o are the Pauli matrices 
and B = ;e;#/2mc is the Bohr magneton. If the crystal has a centre of inversion, 
the spin-orbit interaction simply changes the magnetic moment of the electron, 
so that the interaction of the spin with the magnetic field becomes 


— Bo;Hy&ix(k). (56.12) 


For in this case the Hamiltonian must be invariant under the simultaneous 
operations of time reversal and inversion. The transformation must be accompa- 
nied by the changes H+—H and o-—o with k unchanged; (56.12) is the 
most general expression satisfying the given condition. The tensor &;(k) can- 
not, of course, be calculated in a general form. 

Lastly, let us consider the behaviour of the electron when a weak electric 
field E is applied to the lattice. The weakness condition is that the energy ac- 
quired by the electron in the field over a distance ~ a is small compared with 
the characteristic energy £o: | elEa<« co. 

As with a magnetic field, the most important terms are those which contain 
an increasing function of the coordinates, the scalar potential (r) of the electric 
field. The dependence of the Hamiltonian on ¢ can again be found in a general 
form by arguments similar to those used above. The application of a fictitious 
constant potential @ = ġo is equivalent in Schrédinger’s equation to the inclu- 
sion in the energy of a constant term edo; such a term is also added to each of 
the eigenvalues e (k). When the potential ¢(r) is not constant but varies slowly 
in space, a corresponding operator term is added to the effective Hamiltonian 
in the k representation: 

HA, = ¢(k)+e@(f). (56.13) 


t A simple example of the calculation of the correction term will be given in §59. An 
account of the regular method of deriving the Hamiltonian as a series in powers of H, and 
general expressions for the first few terms in this series, are given by E. I. Blount, Physical 
Review 126, 1636, 1962; Solid State Physics 13, 305, 1962. If the crystal has a centre of 
inversion, the series begins with terms of order H? (see £59). 
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§ 57. Quasi-classical trajectories 


Let us apply the results of §56 to the important case where the electron motion 
in the magnetic field is quasi-classical. The condition for this is, as we know, 
that the de Broglie wavelength of the particle varies only slightly over distances 
of the order of itself. In the present case, this condition is equivalent to the 
inequality 

rœ À: (57.1) 


the radius of curvature of the orbit is large in comparison with the wavelength 
A ~ 1/k.t 

In the quasi-classical case, the particle trajectory is a meaningful concept. 
This is determined by the equations of motion, which are obtained from (56.11) 
by replacing the operators by the corresponding classical quantities : 


AK = —ôH/ðr, v=0H/hOK, H=e (x -7 40) 


the suffix s is omitted, for brevity. We expand these equations by changing from 
the generalized quasi-momentum K to the “kinetic quasi-momentum” 
k = K—eA(r)/fic. 
We have 
dk e dA) OH _ e OA; 


t e d oe eee 





Writing here dA/dt = (v. V) A and noting that 
(v7) A;—(v. V7) A = vXcurl A = vxH, 


we obtain the equation of motion 


dk e 
h-i = 7 VXH, 


_ 9e(k) 


= hok- ° (5 7.2) 


This equation differs from the ordinary classical Lorentz equation only as 
regards the function e(k), which instead of a simple quadratic function is a 
complicated periodic function, and the function v(k) is therefore also of this 
kind. There is consequently, of course, a considerable change in the motion 
of the electron. 

Let us consider this motion in a uniform magnetic field. Multiplying equation 
(57.2) by v, we find in the usual way that fiv.dk/dt = de/dt = 0. Multiplying 


t This condition is in general stronger than (56.3). But, if k ~ 1/a, as occurs for conduction 
electrons in a metal, the two conditions coincide and are in practice always satisfied: for 
rq ~ chk/\|e|H ~ ch/a\e|H, the condition rg > a leads to H « ch/|e| a® ~ 108— 10° Oe. 
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(57.2) by H gives d(H.k)/dt = 0. Thus for the motion of the electron in the 
lattice, as in the motion of a free electron in a magnetic field, 


€ = constant, k, = constant, (57.3) 


the z-axis being in the direction of the field H. The equations (57.3) define the 
path of the electron in k-space. Geometrically, it is the outline of a cross-sec- 
tion of the constant-energy surface e(k) = constant by a plane perpendicular to 
the magnetic field. 

The constant-energy surfaces may have many different forms. They may 
include (in each cell of the reciprocal lattice) several disconnected sheets, which 
may be singly or multiply connected, open or closed. To illustrate the latter 
difference, it is useful to consider a constant-energy surface which continues 
periodically throughout the reciprocal lattice. In each cell, there will be equal 
closed cavities; the open surfaces pass continuously through the whole lattice 
to infinity." 

The cross-sections of the constant-energy surface are made up of an infinity 
of contours. These include both cross-section contours of different sheets of the 
constant-energy surface within one reciprocal lattice cell and those of sheets 
repeated in different cells. If a sheet of the constant-energy surface is closed, all 
its cross-sections are closed curves. If it is open, its cross-sections may be either 
closed, or open (i.e. continuous throughout the reciprocal lattice). 

The quasi-classicality of the motion also presupposes the smallness of the 
probability of magnetic breakdown, i.e. a discontinuous change in the electron 
quasi-momentum as it passes from one contour to another; we shall give the 
condition for this smallness at the end of the section. If this probability is neg- 
lected, therefore, the electron moves only along one contour of the constant- 
energy surface. 

Let us consider in more detail the motion along closed trajectories in quasi- 
momentum space. It is evident that such a motion is periodic in time, and we 
can determine its period as follows. Taking the projection of equation (57.2) on 
the k,k,-plane perpendicular to the field, we obtain 

dl;, _ le| H 


ae ch vi, Vy = JS(vi+v), 


where dl, = +/(dk‘,+ dk?) is an element of length of the k orbit. Hence 


ch f dl 
jelH | v` 





i= 


t To avoid misunderstanding, we should mention that it may be impossible to choose the 
reciprocal lattice cell in such a way that all the essentially different (i.e. not periodic repetitions) 
closed cavities are within one cell and not cut by its faces, 
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If the trajectory is closed, the period of the motion is given by the integral 


ial (57.4) 





taken along the whole contour. This expression may now be converted into 
a more obviously significant form. 

We use the area S(e, k,) of the cross-section of the constant-energy surface 
€ = constant by the plane k, = constant. The width of the annulus in this plane 
between the contours e = constant and e+de = constant is at each point 


de = de 
\Oe/Ok,| v,’ 
so that the area of the annulus is 
dS = de $ dl,/fiv .. 


Hence we see that the integral in (57.4) is just the partial derivative 0S/0e. Thus 
the period of the motion is 


ch? = OS(e, kz) 


= Ter H Os (57.5) 
(W. Shockley 1950). Here it is natural to introduce the quantity 
m* = (h?/22) OS/0e, (57.6) 


called the cyclotron mass of the electron in the lattice. The orbital revolution 
frequency of the electron is expressed in terms of this quantity by 


wa = le! H/m*c, (57.7) 


which differs from the usual formula for the Larmor frequency of free electrons 
in that their mass is replaced by m*.t 

We must emphasize, however, that for electrons in a lattice the cyclotron mass 
is not a constant, but a function of € and k,, and is therefore different for differ- 
ent electrons. Moreover, it may be either positive or negative, the electron 
moving in its orbit as a negatively charged particle or positively charged hole 
respectively. Accordingly we speak of electron and hole trajectories. 

Hitherto we have discussed the trajectory of an electron in k-space. It is easy 
to see, however, that there is a close relationship between the trajectories in 
quasi-momentum space and ordinary space. The equation of motion (57.2), 


written in the form 
dk = —'e'drXH/c, 


t For a free electron, the constant-energy surface is the sphere e = f°4°/2m. Its cross-sec- 
tions are circles with area S = 2(2me/h?—k?), and so the derivative 0S/de = 22m/h*, and 
m* = m. 
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gives on integration (with the appropriate choice of the origin of the coordinates 
r and the quasi-momenta k) 


Ak = — 'eirxH/c. (57.8) 


From this wesee that the xy-projection of the orbit in ordinary space essentially 
repeats the k trajectory, differing from it only in orientation and scale: it is 
obtained from the k trajectory by the changes 


kx — —!e| Hylhc, k, + !e! Hx/he. 


Moreover, in ordinary space there is a motion along the z-axis with velocity 
v, = ĝe/hðk,. If the trajectory in k-space is closed, in ordinary space it is a 
helix with the axis along the field. If the trajectory is open, so is its projection on 
the xy-plane in ordinary space, and the motion in this plane is therefore in- 
finite. 

Some comments may be added about the quasi-classical motion of an elec- 
tron when a constant and uniform electric field E is applied to the lattice. 
The quasi-classical equation Ak = eE gives 


k = ky +eEt/h. (57.9) 
The law of conservation of energy gives 
€(k)—eE.r = constant. (57.10) 


But the energy e(k) takes values in a finite range 4e (the band width); it there- 
fore follows from (57.10) that the motion of an electron in a uniform electric 
field is finite in the direction of the field: the electron oscillates in that direction 
with amplitude 4e/| e! E. If the field is parallel to any basic vector b of the re- 
ciprocal lattice, the motion is periodic with frequency œw = 27! e'E/fib; when 
b ~ 1/a, we have iw, ~ e] Ea. In the general case of an arbitrary direction of 
the field, the motion is quasi-periodic. 

Lastly, let us consider the condition for the magnetic breakdown described 
above to be negligible. The probability of transition from one trajectory (in 
k-space) to another is, of course, large if these trajectories anywhere pass 
unusually close to each other. Such a situation arises when the trajectory is close 
to a self-intersecting one, or if it passes near the intersection of two sheets of the 
constant-energy surface (i.e. near a point of degeneracy). Figure 14 shows a 
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typical pattern of trajectories in such cases; the gap ôk between the trajectories is 
small compared with the characteristic dimensions of the orbits as a whole, and 
the radius of curvature R, of the trajectories near their points of closest ap- 
proach is in general of the order of 5k. The transition from one orbit to the other 
takes place by quantum tunnelling. The probability of this process is (exponen- 
tially) small if 6k is large compared with the distance 4k, over which the wave 
function decays in the classically inaccessible region between the trajectories. 

An estimate of 4k, can be obtained by using the analogy between the motion 
of the electron in the magnetic field and a one-dimensional motion in a poten- 
tial field U(x). This analogy is based on the fact that, according to (56.10), the 
operators ĝ = k,hc/|e| H and ĵ = hk, obey a commutation rule that is the 
same as for the coordinate and momentum. Near the points of closest approach, 
the trajectories are parabolic, like the (x, p) phase trajectory of one-dimensional! 
motion in a uniform field (U = — Fx), whose equation is p?/2m = Fx if the 
coordinate x is measured from the turning point. In the latter case, the wave 
function decays over a distance Ax ~ (h?/mF)'* beyond the turning point (see 
OM, §24); with the radius of curvature of the phase trajectory R ~ (d?x/dp”)™} 
~ mF, we find 1x ~ (#?/R)'*. By the analogy mentioned, the 4k, sought can 
be found by making the changes Ax — ficAk,/|e| H, R — R,h|e| H/c. Thus we 
have lk, ~ (|e) H/fc)’* (ôk), and the condition Ak, < ôk becomes 


lel H/fic « (6k)?. (57.11) 


§ 58. Quasi-classical energy levels 


We have seen that classical motion of an electron in a lattice in a magnetic 
field, along a closed trajectory in k-space, corresponds in ordinary space to a 
motion that is finite in a plane perpendicular to the direction of the field H. 
When we change to quantum mechanics, there are discrete energy levels for 
any fixed value of the longitudinal quasi-momentum k,. These are determined 
by the general rules of quasi-classical quantization. 

We choose the vector potential of the uniform magnetic field (along the 
z-axis) in the form A, = — Hy, A, = A, = 0. Then the components of the gen- 
eralized quasi-momentum are 


K; = kx+le|Hylch, K, =k, K= k.. (58.1) 


The coordinate x is a cyclic variable, and the x-component of the generalized 
quasi-momentum is therefore conserved: 


K, = k+ |e] Hy/ch = constant. (58.2) 


According to Bohr and Sommerfeld’s quantization rule (see QM, §48), we 
can write the condition 
1 ! 
or $ K, dy! = N, (58.3) 
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where the integration is taken over the period of the motion, and n is a positive 
integer, assumed large.‘ Substituting here from (58.1) and (58.2) K, = k, and 
dy = —(ch/e| H |) dk, we obtain 


es cae Se ee (58.4) 
oe Te] H | $ Y *| ’ . 
where the integral is now taken along a closed trajectory in k-space. This in- 
tegral is just the area enclosed by the trajectory, i.e. the area S(e, k,) of the cross- 
section of the constant-energy surface by the plane k, = constant, as defined 
in §57. 
Thus we have finally 


S(e, kz) = 2n('e! H/ch)n (58.5) 


(I. M. Lifshitz 1951, L. Onsager 1952). This condition implicitly determines 
the energy levels ¢,(k,). Thus the energy band (whose number s is omitted, for 
brevity) consists of a discrete set of Landau sub-bands, each of which is a range 
of energy levels distinguished by the value of the continuous variable k,. 

The quasi-classical quantization condition can be refined by including a cor- 
rection which amounts to adding a number of the order of unity to the large 
quantum number n. The determination of this correction involves a considera- 
tion of the motion near the “turning points” which form the limits of the range 
of integration in (58.3). 

The dependence of K,=k, on yon the electron trajectory is determined by the 
equation 


e(k) = (x. a a 








ky, ke) = constant (58.6) 





for given k, and with K, = constant; the turning point y = yo is determined 
by the condition for the velocity v, = 0e/hOk, to be zero. Near this point, an 
expansion of (58.6) in powers of y — yo gives 


E (E) om (at 


where k = ko). Hence we see that the turning point is approached accord- 
Ing to a square-root law: 





ky — ky = tAV(Y—Jo); 


t For motion in a uniform magnetic field, the integral oK,. dr/27 is an adiabatic invariant 


independent of the choice of the vector potential, K, being the projection of the generalized 
quasi-momentum on the plane perpendicular to the field; cf. Frelds,§21. With our choice of A, 


the integral oK,dx = K, „$dx = 0, so that the adiabatic invariant is the same as the integral 
in (58.3). 
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we take the particular case where the classically inaccessible region is in y < yo. 
This law is the same as the one covered by the usual derivation of the correction 
to the quasi-classical quantization (see QM §§47 and 48). The corrected rule 
(58.5) is consequently 


Sle, kz) = 2n(\e| H/ch)(n+4). (58.7) 


As is clear from the derivation, which is based on the expansion of the func- 
tion (58.6), it is necessary for the validity of the corrected quantization rule 
that the trajectory should pass sufficiently far from the singular points of the 
function e(k) (including the complex branch-points). It is also necessary that 
the quasi-classicality condition should be satisfied at all points near the tra- 
jectory, and in particular that the xy-projection of the velocity 0e/0k should 
not be zero.’ Lastly, it must be remembered that the Hamiltonian (56.7), on 
which the whole of the derivation is based, is itself approximate. If the lattice 
has a centre of inversion, the corrections to the Hamiltonian are quadratic in 
the field, and do not affect the condition (58.7), but if there is no centre of 
inversion they are linear in H; in the latter case the correction term 4 in (58.7) 
is meaningless, since the approximate nature of the Hamiltonian implies an 
error of the same order.* 

The interval Ae between two successive levels corresponds to a change of 
unity in the large number n. It is therefore determined by 


AS = (0S/de) Ae = 27 |e | H/hc. (58.8) 
With the classical frequency wy of periodic motion from (57.7), we have 


Ae = hog. (58.9) 


It must be emphasized that the frequency wy is itself a function of e (and of k,). 
Hence the successive energy levels e, (for given k,) are not strictly equidistant 
as in the case of free electrons (where wy is a constant). 

The fact that the energy levels are independent of the conserved quantity K, 
means that they are degenerate (as are those of free electrons in a magnetic 
field; see OM §112). If we imagine a lattice with a large but finite volume V, 
the degree of degeneracy is finite. The number of states in the range dk, with 
a given value of n is VAS.dk,/(2x)°, where AS is the area in the plane k, = 
= constant between the trajectories with quantum numbers n and n+1. This 


t Near points of anomalous approach of two trajectories, these conditions coincide with 
the requirement that the probability of magnetic breakdown is small. 
+ For free electrons (see the last footnote to §57), the condition (58.7) gives 


e = hog(n+3)+#k2/2m, wp = le! H/me 


in agreement with the familiar Landau formula fora free electron in a magnetic field (QM, 
§112). 
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area is given by (58.8), and we thus find for the required number of states 


Vdk, \e|H 
nyt ch (58.10) 








the same as in the case of free electrons. 

The intuitive reason for the degeneracy of levels in a magnetic field is that the 
energy is independent of the spatial position of the centre of the Larmor orbit 
of the electron. For a free electron, the degeneracy is exact, but for an electron 
in a lattice it may be only approximate: because of the presence of the non- 
uniform (periodic) electric field, the different positions of the centre of the 
orbit in the unit cell of the lattice are not equivalent. This must cause some 
splitting of the Landau levels. 

Taking account of the electron spin causes each level to be split into two 
components; if the spin-orbit coupling is neglected, these components are 
separated (like those for a free electron) by a constant interval 28H, where £ is. 
the Bohr magneton: 


Eno(kz) = En(kz)+0PH, o=+I. (58.11) 


The same situation exists when the spin-orbit interaction is not neglected, 
if the crystal has a centre of inversion. In this case, the states of the electron in 
the absence of the field are degenerate with respect to the spin, and the magnetic 
field removes this degeneracy. The result is the same formula (58.11) with 8 
replaced by £&,(k,), where £,(k,) represents the change in the electron magnetic 
moment. 


§ 59. The electron effective mass tensor in the lattice 


Let us consider a point k = ko in k-space where the electron energy e (k) 
has an extremum; for example, the points corresponding to the top and bottom 
of a band. If there is no degeneracy at this point (apart from the possible Kra- 
mers degeneracy with regard to the spin; see the end of §55), the function 
€,(k) near it has a regular expansion in powers of the difference q = k—Ko. 
The first terms in this expansion are quadratic: 


es(k) = es(ko) +3 AM- ngige. (59.1) 


The tensor m, the inverse of the tensor of the coefficients m™1,, in (59.1), is 
called the electron effective mass tensor in the lattice. We shall show how this 
tensor may be expressed in terms of the matrix elements with respect to the 
Bloch functions p,,, at the point ko. 

If the spin-orbit interaction is neglected, the electron Hamiltonian has the 
form (56.1). We substitute in Schrédinger’s equation with this Hamiltonian 
the wave function in the form 


Woe = Ekt -r u = ET Ask (59.2) 
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The equation then becomes 


h2 h hg? 
T A +U) + BSk al ba = elk) bi (59.3) 
where fp = —ihy is the operator of the actual momentum. 

In the neighbourhood of the point k = ko, the vector q is a small quantity, 
and the expression in the square brackets in (59.3) may be regarded as the 
operator of a perturbation. In the zero-order approximation, when q = 0, 
the functions ¢,, are the same as the Yy, The usual perturbation theory therefore 
enables us to express the correction to the energy in terms of the matrix elements 
with respect to these functions. 

Since ko is an extremum point, there is no correction linear in q. This means 
that the diagonal matrix elements are 


(sk |p| skp) = 0. (59.4) 
To determine the correction quadratic in q, we must take into account the q? 


term in the perturbation operator in first-order perturbation theory, and the 
q term in the second order. This leads to formula (59.1) for e,(k), where 





1. Dik 1 œ (Didsy (Dees + (Pe)sv (Dies, 
me te ele) 8D 


the summation is over all s’ = s. To simplify the notation, the matrix ele- 
ments are written here and henceforward without the diagonal suffix ko: 
Psy = (sko|p|s’Ko). When there are close bands (i.e. small differences ¢,— e,), 
the second term in (59.5) may be large in comparison with the first, and the 
effective masses are then small compared with m. 

Now let a uniform magnetic field H be applied to the crystal. According to 
(56.7), the Hamiltonian acting on functions of the generalized quasi-momentum 
Q is obtained from (59.1) on replacing q by the operator 


@=Q-eAshe, A = 4Hxia/0Q. (59.6) 
The resulting Hamiltonian 
AY = es(ko) + 5AM- ik Gide (59.7) 


is, of course, valid only in the same energy range as the original formula 
(59.1). This means that, as well as the condition (56.3) for the field to be weak, 
it is assumed that the Landau levels considered are not too high. In this sense, 
the quantities q and Q are to be regarded as small; the increase of the potential 
A has the effect that, even in a weak field, we cannot suppose that A is small 
compared with Q. 

t The summation over k’ does not appear, since, according to (55.15), the momentum 


p = mv has only matrix elements diagonal in k, so that all the intermediate states pertain to 
the same quasi-momentum ky. 
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The next terms in the Hamiltonian after (59.7) contain the field H in the 
“pure” form (i.e. without acompanying operators 0/6Q). Such terms cannot be 
found simply from considerations of gauge invariance. Let us determine the 
first of these terms, which is linear in H. Because of the relative smallness of this 
correction, we can calculate it with Q = 0. 

Let us first investigate this problem without the spin-orbit interaction. The 
term of interest, linear in H, can only arise from the term linear in A in the 
original exact Hamiltonian (56.2) of the electron, i.e. by averaging with respect 
to the wave function p,,, the expression 


—(e/2mc)(p-A+A.p) = —(e/mc)A.p; (59.8) 


the equality depends on the gauge already chosen, with div A = 0. This adds 
to the Hamiltonian (59.7) the term 


H® = —M.H, (59.9) 


where 
M = (e/2mc) (sky |r Xp | sko) (59.10) 


is just the mean value of the magnetic moment of the electron in the state sko. 
We must emphasize that the correction (59.9) may be added to the Hamiltonian 
(59.7) without any fear that this effect has already been partly taken into account 
by the substitution (59.6): the terms linear in H in (59.7) do not occur when 
Q =0. 

We expand the expression (59.10) by the matrix multiplication rule, using 
the fact that, from (59.4), p has no diagonal matrix elements: 


M, = i YY KOs (Pars (Qe)se (Pye 


(and similarly for M, and M,); the correction to the Hamiltonian (59.7) is, 
as it should be, expressed in terms of matrix elements of the operator 2. Using 
the relation 

Rys = pys/i(ey — és), 


we can put M in the form 


_ _le r, (Dz)ss (Dy)s's— (Dy) sv (Ds) ss 
Mx = pre. see e(y (59.11) 


If the crystal has a centre of inversion, M is zero, and therefore so is the whole 
correction (59.9): under simultaneous time reversal and inversion, the state of 
the electron is unchanged (if its spin is ignored), and the right-hand side of 
(59.11) is therefore unchanged, whereas the magnetic moment must change sign 
under this transformation. 

We now take into account the spin-orbit interaction in the crystal by adding 
to the Hamiltonian (56.1) the spin-orbit term A, from (55.17). This changes 
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the term linear in q in equation (59.3): the operator p in this term is replaced by 
T = pt(h?2/4m?c?) aX VU. (59.12) 


The operator 7 has a simple physical significance: on directly commuting the 
Hamiltonian (including A,) with r, we find, in the absence of a magnetic 
field, 

i = ĝ/m. (59.13) 


Similarly, in the presence of a magnetic field, by making the usual substitu- 
tion p + Pp—eA/c in the original Hamiltonian (including 7,,), we find that the 
term linear in A has the form —em.A/mc, which differs from (59.8) only in 
that ĵ is replaced by 7. The magnetic moment (59.11) must be supplemented by 
the spin magnetic moment of the free electron, giving 


= B(SKo | 0x! Sky) +> — =, Caer Cy)ee — Codec ies (59.14) 


With the spin-orbit interaction included, the second term in this expression is 
not zero even if the crystal has a centre of inversion: simultaneous time reversal 
and inversion yield a state with the opposite spin direction, so that the whole 
expression (59.14), if it changes sign under this transformation, must reduce 
to the mean value of the operator fo $(k); cf. (56.12). 

Let us calculate the tensor &, for the case where the spin-orbit interaction 
may be regarded as a perturbation.' We can rewrite (55.17) as 


Hy = @.X, X = (if?/4m’c?) VUXV. (59.15) 


Regarding (59.9) and (59.15) as a perturbation, we find the correction to the 
energy in second-order perturbation theory, retaining only the cross-terms from 
(59.9) and (59.15). This correction (which still is an operator, a matrix with re- 
spect to the spin variables) has the form (56.12), with the tensor &, given by 


(Xi) ss’ (Lx) s+ (Li)ss (Xi)ss 
> y Miss Pads T Cada ides (59.16) 


bx = = Ô; ktr 
Ey — Es 


where AÊ = r xp. 

The above discussion has related to states that are not degenerate (except 
with respect to the spin). If there is degeneracy at k = kọ, the energy has to 
be determined by setting up the secular equation which takes account of the 
perturbation (the square brackets in (59.3)) as far as the second-order terms, 
i. e. according to QM, (39.4). The properties of the resulting secular equation 
depend on the symmetry at the point k,. We shall return to this topic in §68. 


t The expression (55.17) for H,, is the first term in an expansion in powers of the relativistic 
ratio (v/c)”, and is therefore, in a certain sense, always small. This smallness, however, is irrel- 
one . . . A . 
evant to the applicability of perturbation theory in any particular band. Hence H,,, in the 
problem under consideration, cannot always be regarded as a small perturbation. 
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PROBLEM 


Find the quasi-classical energy levels for a particle with the quadratic dispersion relation 
(59.1) in a magnetic field acting in any direction. 


SOLUTION. We reduce the tensor my, to diagonal form, and measure energy and momentum 
from an extremum point, say the minimum. Then 


=n (28,8), a) 
where mı, Mz, m3 are the principal values of the tensor my (and are positive quantities). Let 
n be a unit vector in the direction of the field H; then 

k, = n.k = niıkı+ nka + nks, (2) 
where 7, 712, ną are the direction cosines of the field relative to the principal axes of the tensor 


m,,. We have to find the area S of the part of the plane (2) that lies within the ellipsoid (1); it 
may be written as the integral 


S = f 6m.k—k,) dk, (3) 
taken over the volume of the ellipsoid (1).t By the change of variables Ak, = (2em)? q,, the 
integral is brought to the form 

S = (26)? A-%(mymym,)"” f dv .q— k) d?q, 


where the vector y in q-space has the components v; = (2em;)!?n;/h, and the integration is 
taken over the volume of the sphere q? = 1. The integration is easily carried out in cylindrical 
polar coordinates with the axis along v, and gives 


Sle, k,) = (270/h?) m i (e— fk? /2m), 


where 


II 


myn? + mon? + m,n}, 
(mym.m3/m, pi? . (4) 


my 


my 


Substituting in (58.7), we find the energy levels 


le; ŘH ( z) h?k? 


n+- J+ ; 
mic 2m; 


7 (5) 





En(Kz) = 


§ 60. Symmetry of electron states in a lattice in a magnetic field 


In this section we shall consider the exact general properties of translational 
symmetry of the wave functions of a Bloch electron in a magnetic field, which 
do not depend on any approximation (such as the condition for the field to 
be weak or the quasi-classicality condition). 

The application of a uniform magnetic field does not affect the physical 
translational symmetry of the system, which remains spatially periodic. A 


t Letf(x, y, z) = constant be a family of surfaces occupying a certain volume. The distance 
di between two of these surfaces, infinitely close to each other, is d/ = df) | Vf |, and the volume 
between them is dV=S(f )dl, where S(/) is the area of the surface having a given value of f. 
Multiplying the equation S(f)df =| Vf| dV by the delta function 6(/) and integrating over the 
volume and df, we find the area of the surface f(x, y, z) = Oas S(0)= f | vf 1d(/) ax. In our 
case | Yf | =- 1, and this gives equation (3). 
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distinctive feature is, however, that the electron Hamiltonian (56.2) does lose 
its symmetry, because it involves not the constant field strength H but the 
vector potential A(r), which depends on the coordinates and is not periodic. 

The non-invariance of the Hamiltonian naturally complicates the transfor- 
mation law for wave functions under translation. We take the vector potential 
of the uniform field in the gauge 


A = }Hxr, (60.1) 


and let y(r) be an eigenfunction of the Hamiltonian A(r). Under a translation 
r-r+a (where a is any of the basic lattice vectors), this function becomes 
y(r+a), and is then an eigenfunction of the Hamiltonian A(r+a), which is not 
the same as A(r), because the vector potential has changed: 


A(r) ~ A(r +a) = A(r)+4H xa. 


To find the required transformation law, we must go back to the original 
Hamiltonian by the gauge transformation 


A+A+VUf, f=—-4Hxa.r. 


The wave function is then transformed according to (56.4): y > y exp (ief/hc). 
Denoting the result of all these operations by 7,y(r), we thus find 


T(r) = p(r-+a) exp (dir.h Xa), (60.2) 


where h = |e| H/fic; T, is called the magnetic translation operator. If y(r) is 
a solution of Schrédinger’s equation A(r)y = ey, then (60.2) is a solution of 
that equation for the same energy e (R. E. Peierls 1933). 

From the definition (60.2), we easily find that 


T.Ty = Ta 40'0(8, a’), (a, a’) = exp(—4 ih.a xa’). (60.3) 


When a and a’ are interchanged, the exponent in the factor w(a, a’) changes 
sign, and the operators 7, and T, therefore do not in general commute: 


T.Ty = ToT: exp (—ih.a Xa’). (60.4) 


Thus the product of two operators Ê, and T,, generally differs by a phase 
factor from the operator T,,,. In mathematical terms, this means that the 
operators 7’, give not an ordinary but a projective representation of the trans- 
lation group; the basis of these representations is formed by the wave functions 
of the stationary states of a Bloch electron in a magnetic field.t The classifica- 


t Projective representations of groups have already been met with in Part 1,§134. The pro- 
jective representations of a group G are those given by operators G such that the relations 
between them are the same as those between the corresponding elements of G only to within 
phase factors: if GiG: = G3, we have for the operators Ĝ1Ĝ, = @12G3, where ws need only 
have unit modulus. 
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tion of the energy levels must consequently be based on the irreducible projec 
tive representations of the translation group, just as, in the absence of the field, 
they are based on the irreducible ordinary representations of this group. 

The translation group is Abelian (all its elements commute), and therefore 
all its irreducible ordinary representations are one-dimensional. The base 
function y of each such representation is simply multiplied by a phase factor 
under translation; for two successive translations, this factor must be equal to 
the product of the factors for each translation separately. This means that 


Tw = ekan, 


where k is a constant vector, the quasi-momentum of the electron, which is the 
parameter classifying the irreducible representations. 

A complete classification of the irreducible projective representations of the 
translation group can be made (E. Brown 1964, J. Zak 1964) if the magnetic 
field satisfies the condition 

h = 4xpa,/qv, (60.5) 


where p and q are any two relatively prime integers, and ag is one of the three 
arbitrary chosen basic vectors a1, a2, ag of the lattice; v = a; Xag.az is the 
volume of a unit cell of the lattice. Thus the magnetic field must be along one 
of the lattice vectors, and hv/4aag must be a rational number. Multiplying 
(60.5) by a; Xa, we can also write this condition as 


h.a, Xa, = 4zp/q. (60.6) 


To classify the irreducible projective representations of the translation group 
it is important to note that we can select from this group a sub-group (the 
magnetic sub-group) with respect to which the representation is not projective 
but ordinary. With the condition (60.6), such a sub-group is the set of transla- 
tions of the form 

am = Na + Noqa + NgAg (60.7) 


with integral coefficients nı, n2, ng. For, when the vector h is along as and 
satisfies the condition (60.6) for all translations of this form, the exponent in 
(60.3) is zero or a multiple of 2z, so that all the factors w(a, a’) = 1.1 The set of 
translations (60.7) forms a lattice with basic vectors a1, ga2, a3, which we call 
the magnetic lattice. The magnetic reciprocal lattice correspondingly has basic 
vectors by, b2/4, bs, where bı, be, bg are basic vectors of the original reciprocal 
lattice. 

The irreducible ordinary representations of the magnetic sub-group, like 
those of the whole translation group, are one-dimensional; they are charac- 


t The choice of the magnetic sub-group is in general not unique: instead of (60.7) we can 
take any set of translations of the form am = 14181 + N2422 + Ngaa, where g1, Qz are integers 
such that qg: = @. 
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terized by wave vectors (quasi-momenta) K, all non-equivalent values of which 
are in one cell of the magnetic reciprocal lattice. 

Let y® be the base function of one such representation with quasi-momen- 
tum k™ = K. For this function, 


Tos yD(r) = ek -am y(r). (60.8) 


In a translation through the basic vector a2 (which is not in the magnetic sub- 
group), we obtain from y“ a function y‘? with a different quasi-momentum. 
To determine the latter, we use (60.4) and (60.8) to write 


Pan yO = Pnn În, yO) 
= exp(—ih. am X a2) Ts, Ta, YO) 
= exp {— iam a) Xh tiam -kO}T,, pO(r) 


or finally 
Pan PUR) = em g(r), 


where 
k® = k®—a, Xh = K—2(p/q)b,; 


in the last equation we have substituted (60.5) and the reciprocal lattice vector 
bı = 27a Xag/v. We must now take separately the cases of odd and even gq.” 

Let q be odd. Repeating the translation through ag a further g—2 times, we 
obtain a total of q different functions with quasi-momenta 


k® = K, k® = K—2(p/q)b,, ..., k® =K—2 ay, (60.9) 


By subtracting an appropriate integral multiple of the vector bı, these values 
are converted (in some order) to the values 


k=K, K-+b,/g, K+2b,/q,...,K+(g—-1)b,/g. (60.10) 


These q functions form a g-dimensional irreducible projective representation 
of the translation group. We obtain all the non-equivalent representations when 
K takes values in a cell with sides bi/g, be/q, bs; the quasi-momenta k™, 
k™,... then take values in a cell with sides bi, be/q, bs. 

Now let q be even. Then, in the sequence (60.9), the (¢q+1)th value, equal 
to K—ph,, differs from K only by an integral multiple of the reciprocal lattice 
vector bı. Thus there are only +4 nonequivalent values of k, which are given 
by (60.10) with tq instead of q. In this case, therefore, the irreducible represen- 


t When q = 1, the magnetic sub-group is the same as the complete translation group. Thus, 
if h is an integral multiple of 47a3/v, the irreducible projective representations of the trans- 
lation group are the same as the ordinary irreducible representations, and the electron states 
are Classified in the same way as in the absence of the field. 
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tations are -:q-dimensional, and K takes values in a cell with edges 2b,/q, b2/q, 
bs. 

These results enable us to formulate the following conclusion about the man- 
ner of variation of the electron energy spectrum in the lattice when a magnetic 
field is applied to it that satisfies the condition (60.5). In the absence of the field, 
the spectrum consists of discrete energy bands, in each of which the energy 
é(k) is.a function of the quasi-momentum, the latter taking values in one recip- 
rocal lattice cell. When the field is applied, the band splits into g sub-bands, 
in each of which all the energy levels are q-fold or +q-fold degenerate, according 
as q is odd or even. The energy in the sub-band may be expressed as a function 
e(K) of the vector K, the latter taking values in 1/q? or 2/q? of the reciprocal 
lattice cell, according as q is odd or even. 

The picture given above is, in a certain sense, extremely sensitive to the mag- 
nitude and direction of the magnetic field. For, there are values of H arbitrarily 
close to a value satisfying (60.5) with some p and q, which satisfy the same 
condition with much larger q, so that by an infinitesimal change of the field 
the number of sub-bands can be increased indefinitely. It must be emphasized, 
however, that this does not imply a similar instability in observable physical 
properties. These are determined not by the specific band structure, but by 
the distribution of the number of states among small but finite energy ranges, 
and this distribution changes only slightly with the field, because what is 
greatly changed is not the energy of the states but only their classification as a 
result of the altered range of definition of the quasi-momentum. 


§ 6l. Electronic spectra of normal metals 


In actual crystals of normal (not superconducting) metals, the electrons form 
a quantum Fermi liquid of the type described in Chapter I. Certain differences 
arise, however, because this is not a “free” isotropic liquid but a liquid in the 
anisotropic periodic field of the lattice. 

Just as the energy spectrum of a free Fermi liquid has a similar structure to 
that of an ideal Fermi gas, so the spectrum of the electron Fermi liquid in a 
metal has a similar structure to the of an ideal “gas in the lattice”. The occur- 
rence of the quasi-momentum as a conserved quantity is due only to the spatial 
periodicity of the system (just as the conservation of the actual momentum 
is a consequence of the complete spatial homogeneity). It is therefore natural 
that the properties enumerated in §55 apply to the classification of levels in the 
spectrum of an electron liquid in a metal, the role of the particles (electrons) 
being taken by the quasi-particles. 

At absolute zero, the particles of an ideal Fermi gas in a periodic field occupy 
all the lowest levels, with energies £ up to a limiting value ep (which is equal to 
the chemical potential u at T = 0), determined by the condition that the num- 
ber of states with € = ep is equal to the total number of electrons. The energy 
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bands such that e,(k) < £p for all k are fully occupied; those with e,(k) > £p 
are empty, and those for which the equation 


es(k) = Er (61.1) 


has a solution are partly occupied. Equation (61.1) determines in k-space the 
limiting Fermi surface which (for each band) separates the filled and empty 
States. 

Similarly, in an actual metal there is a surface in k-space which separates 
the region of quasi-particle states that are filled (at T = 0) from the unoccu- 
pied states; on one side of this surface the quasi-particle energies € > £p, 
and on the other side € < ep. However, the concept of quasi-particles in a 
Fermi liquid has a real physical significance only near the Fermi surface, where 
the decay of the elementary excitations is relatively slight (see §1). Hence the 
idea of occupied energy bands which occurs in the description of the spectrum 
of an ideal Fermi gas has no literal meaning in an actual electron liquid. 

The quasi-particles near the Fermi surface are called conduction electrons. 
Their energy is in general a linear function of the quasi-momentum; similarly 
to (1.12), we have 

e(k)— ep ~ A(k—ky). vr, (61.2) 


where k, is a point on the Fermi surface and 
hyr ~ (0e/0k)k = kp (61.3) 


gives the conduction electron velocity at such a point. 

Near the Fermi surface there must also be a “transitional zone” in the con- 
duction electron distribution at non-zero temperatures. Hence we have as the 
condition for the Fermi-liquid theory to be valid T«fk,v,, where kp and vp 
are the characteristic values for the dimensions of the Fermi surface and the 
velocity on it. The dimensions k,are usually of the same order of magnitude 
as those of the reciprocal lattice cell, so that kp ~ 1/a; an exception occurs 
for the semi-metals (see below). Putting also as an estimate vp ~ fk,/m, we 
get the condition T < 104—105°K, which is always satisfied in practice. 

Almost all metals have crystal lattices with a centre of inversion. According 
to the discussion at the end of §55, all energy levels of the conduction electrons 
(with given k) are doubly degenerate with respect to the spin (the metals re- 
ferred to are neither ferromagnetic nor antiferromagnetic). 

The shape and configuration of the Fermi surface are important charac- 
teristics of any particular metal. In various metals they are quite different and 
in general complicated. The Fermi surface may consist of several separate 


t Formulae such as (2.11) for the effective mass, derived in §2 for the “free” Fermi liquid 
from considerations of Galilean invariance, of course do not apply to the electron liquid in a 
crystal lattice. 


§61 Electronic Spectra of Normal Metals 253 


sheets, which may be singly or multiply connected, open or closed; cf. the 
discussion of constant-energy surfaces in §55. 

The closed sheets of the Fermi surface can be divided into two classes, accord- 
ing as they form the boundary of regions of filled (at T = 0) or empty states 
of quasi-particles (in the former case € < &p within the cavity, in the latter 
case € > £p). Both cases can, however, be described in a similar manner if we 
suppose in the second case that the “empty” cavity is filled with “quasi-holes” ; 
the transition of the system to the excited state is then described as a transi- 
tion of quasi-holes from inside to outside the Fermi surface. The Fermi surface 
itself is then said to be hole-type, in contrast to electron-type in the first case.’ 
The physical difference between the two types of quasi-particles (electrons and 
holes) is clearly seen when they move in external fields. For example, all 
cross-sections of a hole-type (or electron-type) Fermi surface, which determine 
the quasi-classical trajectories for motion in a magnetic field, are of the hole 
(or electron) type as defined in §57. 

In an isotropic “free” Fermi liquid, as considered in §1, the Fermi surface 
was a sphere whose radius was determined by the density of the liquid in accord- 
ance with Landau’s theorem (1.1). An analogous relation holds for the elec- 
tron liquid in a metal, but the specific nature of the properties resulting from 
the periodicity of the lattice causes some changes in its formulation. 

The number of electrons in the metal is conveniently expressed per unit 
lattice cell. Let n be the total number of electrons in the atoms in one cell, and 
Tp; the total volume per cell of the reciprocal lattice on the occupied side of the 
Fermi surface (i.e. the side where € < ep). The word “total” here signifies 
that if the occupied regions corresponding to different sheets of the Fermi 
surface partly overlap, they must still be combined independently. The volume 
Tp will be measured in units of the reciprocal lattice cell volume itself, and the 
remark just made about overlapping regions means that Tp thus defined may 
exceed unity. 

The requisite proposition (Luttinger’s theorem), which replaces Landau’s 
theorem (§20) for a metal, is expressed by 


No = 2p = n— 2l (61.4) 


where / is an integer ( = 0). In the model of an ideal gas in a lattice, this num- 
ber has a simple significance: complete occupation of each band corresponds 
to two electrons in the reciprocal lattice cell (because of the two spin states), 
so that 2/ is the number of electrons occupying the / lowest bands, and n—2/ 
is the number of electrons in partly filled bands. Formula (61.4) expresses the 
far from trivial result that a similar situation continues to hold when the 


t We must emphasize, however, to avoid misunderstanding, that the term “hole” is not 
used here in the same sense as in the alternative method (see the end of §1) for describing the 
spectrum of a Fermi liquid; there, the holes were simply empty places formed in the filled 
region on excitation of the system. 
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interaction between electrons is taken into account.’ By the definition of a 
metal, the integer n, is not zero. 

Let there be only closed sheets (electron-type and hole-type) of the Fermi 
surface in a metal; let t and t“ be the contributions to Tp from the electron 
and hole cavities individually: 


tr= > t+ Sir; 
S S 


the summation is over all electron-type and hole-type sheets respectively. The 
quantity 7“? is the volume of the electron-type cavity, and 1—t® is that of 
the hole-type cavity. The numbers of electron and hole quasi-particles are 


n_=2>) 1%, n} =2¥VU-7®). 


With n even (and therefore n, even), cases can occur where n, is equal to twice 
the number of hole-type cavities. Equation (61.4) is then easily found to reduce 
to 

n = ñ}. (61.5) 
Such metals with equal numbers of quasi-particles and quasi-holes are said 
to be compensated. 

It should be pointed out that, when equation (61.5) is exactly satisfied, 7_ 
and n, themselves may be arbitrary, and in particular may be arbitrarily small. 
In such cases, when the volumes of all the cavities of the Fermi surface are very 
small (in comparison with the volume of one reciprocal lattice cell), the sub- 
stance is said to be a semi-metal.* There is, however, a lower limit to the num- 
ber of conduction electrons, beyond which an electron spectrum of the metal 
type becomes unstable and ‘cannot exist (see the end of §66). 

The thermodynamic quantities for a metal consist of lattice and electron 
parts. The temperature dependence of the latter is determined by the quasi- 
particles in the neighbourhood of the Fermi surface (the dispersion relation 
(61.2)). The nature of this dependence is of course the same as for an ideal 
Fermi gas or an isotropic Fermi liquid (cf. §1); the only difference in the formu- 
lae is due to the different number of quasi-particle states near the Fermi sur- 
face, which is no longer a sphere. 

Let v de be the number of states (per unit volume of the metal) in an energy 
range de. The volume element in k-space between infinitely close constant- 
energy surfaces corresponding to energies £p and e,+de is df de/hvp, where df 
is an element of area on the Fermi surface and vp the magnitude of the vector 
Vr = (1/4) 0e/0k normal to the surface. Hence 


z | af (61.6) 


PE = Gaye | Tor” 


tł For a rigorous derivation of this result, see J. M. Luttinger, Physical Review 119, 1153‘ 
1960. 
+ For example, in bismuth n_ = 11, ~ 1075, 
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where the integration is over all the sheets of the Fermi surface within one 
reciprocal lattice cell; for an open Fermi surface the faces of the cell itself are, 
of course, not part of the range of integration. 

The quantity (61.6) replaces in the thermodynamic quantities the expression 
which for a gas of free particles (with a spherical Fermi surface) was 


2  4npr _ mpr 
(22h)® pr/m 








— ah 


For example, the electron part of the thermodynamic potential 2 of a meta! is 


(cf. Part 1, §58) 
Q: = Que — 4 nwr VT?, (61.7) 


where £2), is the value of the potential at T = 0. Regarding the second term 
in (61.7) as a small correction to 2),, by the theorem of small increments, we 
can write down a similar formula for the thermodynamic potential ®: 


©, = Dye — tnw VT? (61.8) 


where vp and V are now assumed to be expressed in terms of P (in the “zero” 
approximation, i.e. at T = 0). 
Determining from (61.8) the entropy and hence the specific heat, we find 


Ce = 4 n%pVT. (61.9) 


The lattice part of the specific heat is proportional to T? (at temperatures small 
compared with the Debye temperature ©); hence, at sufficiently low tempera- 
tures, the electron contribution to the specific heat becomes predominant.* 

For the same reason, the electron contribution to the thermal expansion of 
the metal becomes predominant in this temperature range. Determining from 
(61.8) the volume V = 0®/0P and hence the thermal expansion coefficient x, 
we find 








_1/8V\ __ p a OVrr) 
a= Fù), = rT (61.10) 


Here, as also in the range T >> O (see Part 1, §67), the ratio 
aV/C = —Olog (Vvr)/OP 


s independent of the temperature. 


§ 62. Green’s function of electrons in a metal 


The discussion in §§56-58 related to the moticn cf one electron in a lattice to 
which an external magnetic field is applied. We shall now show that the results 
obtained remain essentially valid for quasi-particles (conduction electrons) 


t The smal] parameter of the expansion in (61.9) is the ratio Tjep; in the lattice specific 
heat it is 7/O; thus the two parts of the specific heat beccme comparable when T? ~ ©2 /€p. 
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in the electron liquid in an actual metal; there is only a certain change in the 
definitions of the quantities appearing in the relations (Yü. A. Bychkov and 
L. P. Gor’kov 1961, J. M. Luttinger 1961). The Green’s function formalism 
is suitable for a general treatment of the electron liquid. 

In Chapter II, this formalism has been developed for a “free” Fermi liquid. 
We shall now show how it must be modified for a liquid in a lattice. 

The Green’s function for an electron liquid (at T = 0) is defined in terms 
of the Heisenberg y operators of the electrons by the same formula (7.9), where 
the averaging is with respect to the ground state of the metal. Because of the 
homogeneity of time, this function depends on the arguments t, and tz only 
through the difference t = t;— tz. The spatial homogeneity, however, is now 
destroyed by the presence of the lattice field, which is external to the liquid. 
Hence the Green’s function does not depend only on the difference rı—r2. 
All we can say is that it is invariant under a simultaneous shift of r, and re 
by the same arbitrary basic lattice vector. In the following we shall consider 
the Green’s function in the a, r representation, i.e. use its Fourier component 
with respect to t, G,,(@; rı, r2). This function enables us, in principle, to deter- 
mine the energy spectrum of the electron liquid in the metal. We shall repeat 
(without going through all the calculations again) the arguments of §8 as they 
apply to the present case. 

It has been shown in §8 that the homogeneity of the system allows a complete 
determination of the coordinate dependence of the matrix elements of the y 
operators and thus the formulation of a general expression for the Green’s 
function in the space-time representation in the form (8.5), (8.6); from this 
we can go to the momentum representation in the form of the expansion (8.7). 

For the electron liquid in a lattice, the invariance of the matrix elements, 
expressed by equation (8.3), occurs only for translations through basic vectors 
of the lattice, ie. for r = a. This naturally leads to less definiteness in the de- 
pendence on the coordinates: instead of (8.4), we can only say that 


(0 | Y(t, r)| mk) = xh) exp [ —i@mo(k) t], (62.1) 
(mk | P(t, vr) 0) = ZSR (r) exp [iomo(k) t], 
where 


Xeme(t) = fiat (62.2) 


— ik. 
AGRA) = eT Vumek(E), 


k is the quasi-momentum of the state, m is the set of all other quantum numbers 
describing the state, and y and v are some functions of the coordinates, period- 
ic in the lattice; we have written the matrix elements only for transitions from 
the ground state, i.e. state 0. The properties of the functions y+? and y‘~ are 
similar to those of the Bloch wave functions of an electron in a periodic field. 
Expressing the Green’s function in terms of these matrix elements and then chang- 
ing to the Fourier components with respect to time (as in §8), we now obtain 
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instead of (8.7) the expansion 
(+) (+)* (-) (-)* 
: oa Yamk(Ey) Lamk (T2) , Xamk(T1) Xpmk (T2) 

Gap(; Frs Ta) = z! opu- eiO | w+p—e—i0 |” (62-2) 
with the same notation e‘+? and e‘— as previously, and the change k + —kin 
the second term. 

The presence of non-decaying one-particle elementary excitations near the 
Fermi surface of a metal has the result that, when € is near u, the energy of the 
state depends only on k. For such states the function G,,(w; rı, r2) has a pole 
at w = e(k)— u. Near the pole, it has the form 


Xok(r1) %ők(T2) 
Gloi 82) = ue Ona (62.4) 
When there is degeneracy with respect to spin, we must also sum over the two 
spin states. 

The determination of the energy spectrum from the Green’s function reduces, 
in principle, to an eigenvalue problem for a certain linear integro-differential 
operator. 

The basic ideas of the diagram technique in coordinate space remain the 
same in this case as for an ordinary Fermi liquid. In particular, with the self- 
energy function 2,,(t, ri, r2) as the sum of the set of diagrams determined in 
§14, we can write the Green’s function G,,(t, rı, r2) as the series (14.3), which is 
summed as the diagram equation (14.4). The thin continuous line in these 
diagrams represents the Green’s function Gt, rı—rz) of free electrons that 
interact neither with one another nor with the lattice. According to (9.6), 
this function satisfies the equation 

( a F s A u) GH(t, rı— r2) = 4up6(t) 6(1, — r2). 
Applying the operator (...) to the left of (14.4) and then changing to Fourier 
components with respect to time, we obtain the required equation 


(w + wt Ay/2m) Gap(O5 ri, r2) — f Zal; ri, 0’) Gyp(o; r’, r3) dex’ 
= 6,0(r; — r2). (62.5) 


Near the pole of G (with respect to the variable w) the right-hand side of 
the equation may be omitted, leaving a homogeneous integro-differential 
equation whose eigenvalues determine the energy spectrum of the system. The 
suffix £ and the variable rg are not affected by any operations here, i.e. they 
act as unimportant parameters in the equation. To determine the spectrum, 
we can therefore use the equation’ 


(w + u+ Ay/2m) %a(r)— | Zuo; r, vx Ar’) dx’ = (w— L) xr) = 0. (62.6) 


t For a microscopically homogeneous Fermi liquid, this equation in the momentum repre- 
sentation reduces to (14.13): 
w+ p= &(p)+ Zlo, p). 
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For an electron Fermi liquid in a metal, it replaces the ordinary Schrédinger’s 
equation. Its eigenvalues determine the spectrum, as already mentioned, with 
o = e(k)— u; the corresponding eigenfunctions are y(r) from (62.4), as is 
evident from a direct substitution of (62.4) in (62.5). Since the decay of exci- 
tations near the Fermi surface is only slight, the operator Ê is Hermitian for 
small œw (up to and including terms of order w). 

To go to the case where a weak external magnetic field is present, we must 
note that, in a gauge transformation of the vector potential, the y operators 
are transformed as wave functions (cf. (44.3), (44.4)), and therefore the Green’s 
function G,,(; rı, r2) is transformed as a product of y functions, w(r,)y (re). 
This means that the function y(r) in (62.6) must also be transformed as an 
ordinary y function. If the arguments of §56 are followed through, we easily 
find that they make use only of the periodicity of the crystal lattice, the general 
properties of the gauge transformation, and the fact that the energy spectrum 
is determined by the eigenvalues of a certain Hamiltonian; in the present case, 
the latter is the operator Ê in (62.6).* It is therefore clear that the result, that 
is the rule for changing from the spectrum in the absence of the field to that in 
the presence of the field, will be the same; the new spectrum is determined from 
the eigenvalues of the Hamiltonian 


e ia ge ee, 
(KFA), Poin, (62.7) 


where e(k) is the spectrum in the absence of the field. The significance of e(k) 
itself is now, of course, different from that in (56.7), since it takes account of 
the collective interaction of all the electrons in the system. 

Next, since the treatment of the quasi-classical case in §§57 and 58 was en- 
tirely based on the existence of a Hamiltonian of the form (62.7), the results 
obtained there are also directly applicable to an electron liquid. The question 
arises, however, of what is to be regarded as the field strength acting on a 
conduction electron (and thus what is to be taken as the vector potential A). 
Strictly speaking, it should be the exact microscopic value of the field at the 
point r due to all the electrons (and to the external field). In the quasi-classical 
case, however, the characteristic dimension ry of the region in which interaction 
occurs (the Larmor radius of the orbits) is large compared with the order of 
magnitude of the distances between electrons, i.e. the lattice constant a. This 
causes an automatic averaging of the microscopic field. The origin of the aver- 
aging may be explained as follows. 

Let us represent the microscopic field strength as the sum of its mean value 
(which is, according to customary terminology of macroscopic electrodynamics, 


t There may seem to be an important difference here in that the operator Ê in (62. 6) itself 
depends on w. In fact, this merely causes the Hamiltonian to be written in an implicit form. For 
small w (near the Fermi surface) we can change to an explicit form by expanding Ê ~ Lo +o, 
and then multiplying the equation Ĉe = w(1—Z£,)y on the left by the operator (1 — Ê,)~ 1. 
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the magnetic induction B) and a rapidly varying part H. The vector potential 
corresponding to the uniform field B increases over the whole extent of the 
orbit, taking characteristic values ~ Bry. The potential corresponding to the 
field H which oscillates over distances ~ a does not increase steadily, and only 
reaches values ~ Ba, which are negligible in comparison with Bry. But, as 
shown in§56, it is the potential of the field which determines the quantization of 
the motion of the electrons. Thus we conclude that it is sufficient to take account 
of just the potential A of the uniform induction B = curl A, which will play the 
part of the field acting on the electron (D. Shoenberg 1962). We shall see later 
(at the end of §63) that this situation may lead to some new phenomena in the 
magnetization of metals. 

Thus the quasi-classical quantization rule (58.7) for the electron liquid in a 


metal is written 
S(e, kz) = (2x |e|/hc) Bn +4), (62.8) 


where S(e, k,) is now the cross-sectional area of the actual constant-energy sur- 
faces of the conduction electrons in a metal (near its Fermi surface). 

As in the problem of one electron in a lattice having a centre of inversion’, 
the inclusion of the conduction electron spin causes a splitting of the levels in 
the magnetic field into two components: 


Eno(Kz) = En(kz) + 0B&(kz)B, o=H+I. (62.9) 


The quantity &(k,) results from the averaging of a function &(k) over the quasi- 
classical trajectory. We can with sufficient accuracy regard all trajectories as 
lying on the Fermi surface itself, so that the result of the averaging depends only 
on k,. We should emphasize that, for electrons in the Fermi liquid, the difference 
of E(k.) from the value unity which it has for free electrons is due not only to the 
spin-orbit interaction but also to the exchange interaction between electrons. 


§ 63. The de Haas—van Alphen effect 


The magnetic susceptibility of a metal in weak magnetic fields (6B «T, 
where £ is the Bohr magneton and B the magnetic induction) cannot be calcu- 
lated in general form. The reason is that in the theory of the Fermi liquid we can 
only deal with the paramagnetic (spin) part of the susceptibility: this part is de- 
termined by the conduction electrons near the Fermi surface, since the electron 
spins within the distribution compensate one another. The diamagnetic (orbi- 
tal) part of the susceptibility, however, contains contributions from all the elec- 
trons, including those within the distribution, where the concept of quasi- 
particles in the theory of the Fermi liquid has no meaning. The two parts of the 
conductivity are in general of the same order of magnitude, and only their sum 
has an actual physical significance. 


t Asin fact do the crystal lattices of all metals. 
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Let us now consider “strong” fields, for which 
T sB «p, (63.1) 


i.e. the intervals between the Landau levels are comparable with the temperature 
but still small in comparison with the chemical potential. In this case, the para- 
magnetic and diamagnetic parts of the magnetization cannot be separated at all, 
but the situation is different in that the magnetization of the metal has an oscilla- 
tory dependence on the field (the de Haas-van Alphen effect).* The monotonic 
part of the magnetization depends in this case also on all the electrons in the 
metal and cannot be calculated within the theory of the Fermi liquid. But the 
oscillatory part of the magnetization is, as we shall see, determined only by the 
conduction electrons near the Fermi surface and can be considered in a general 
form (I. M. Lifshitz and A. M. Kosevich 1965); we shall now discuss this part. 

The oscillatory dependence of the magnetization on the field is a consequence 
of the quantization of the energy levels of the orbital motion of the electrons. 
But the quantization affects only states corresponding to the motion of elec- 
trons in trajectories that are closed (in k-space). Hence the contribution to the 
oscillatory part of the thermodynamic quantities comes only from the conduction 
electrons on closed cross-sections of the constant-energy surfaces by planes per- 
pendicular to the given direction of the field. We shall assume that the quasi- 
classicality condition is satisfied on these cross-sections, i.e. that the numbers 
n determined by equation (62.8) are large: 


hcS/\e| B >> 1. (63.2) 


For typical Fermi surfaces in metals, the linear dimensions of the cross-sections 
are ~ 1/a,sothat S ~ a~?, and the condition (63.2) is certainly satisfied; cf. the 
first footnote to §57. 

The quasi-classical levels are given (with allowance for spin) by the expression 
(62.9), where «,(k,) are the solutions of equation (62.8); for each level there is 
a number of states given by formula (58.10). Hence the partition function which 
determines the thermodynamic potential Q (a function of u, T and the volume V 
of the system) is as shown in 


Q=-riy iP Y, log fı gap e) EL dle (63.3) 
The suffix s numbers the various sheets of the constant-energy surface; this 
suffix, and the sign of summation with respect to it, will be omitted for brevity. 
The integration with respect to k, is taken over an interval such as to include all 
different cross-sections (i.e. excluding periodic repetitions) of all sheets of the 
constant-energy surfaces. 


t Cf. Part 1, §60, where this effect has been discussed for an ideal electron gas. 
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Let us first separate from 2 the part Q that is an oscillatory function of the 
field, transforming the sum (63.3) by means of Poisson’s formula :* 


3 F(0)+ È Fo) š [ Fæde+2re $, f Fede. (63.4) 


The first term in this formula when applied to (63.3) gives the non-oscillatory 
contribution to 2; we omit this, and write 
O |e| BVT 


where [, is the oscillatory part of the integral 
lie = | dn f log | 1+exp | eriln dk, (63.6) 


and we have used the notation u, = u—oßġB. 
For the further analysis we define the function 
chS(e,k:) 1 


n(e, kz) = leB T7 (63.7) 


(cf. (62.8)), and change from integration over n in (63.6) to integration over e: 


ôn 
= e2riln 
lio = f [ee fı +exp 4 Ate Oe 


the choice of the lower limit of integration over £, which has been arbitrarily 
taken as zero, is immaterial, since in any case only the neighbourhood of 
€ = p18 important in the integral. 

Since the function n(e, k,) is large, the exponential factor in the integrand in 
(63.8) is a rapidly oscillating function of k,. These oscillations reduce to nothing 
the integral with respect to k,, and so the main contribution comes from the 
ranges of the variable k,in which n(e, k,) varies least rapidly and the oscillations 
are therefore slowest. Thatis, the main contribution to the integral comes from the 
regions near the extrema of n as a function of k, for any given e. Let k z, ex(é) be 
one such point; near it, we calculate the integral by the saddle-point method, 
putting in the exponent of the exponential 


dk, de; (63.8) 





me, kz) ~ nex(E) +> > (aa a) (kz — kz, ex)”, 


Nex(€) = n(e—k z, ex(£)), 


t See Part 1, §60. It is not important that the term F(0) in the sum in (63.4) has a coefficient 
4, since only the terms with large n are significant in the sum (63.3). 
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and in the non-exponential factors taking the value at k, = k, ex The result is 
that each of the extrema contributes to the integral the term 


Ho—E | dnex 1 | @n -1 ; 1. 
[ rog fi +exp- T | P TA exp 2rilnext 7 in de. 


zZ jex 








0 


The replacement of dn(e, k,)/Oe by dn,,/de is legitimate, since at the extremum 
@n/0k, = 0. The plus and minus signs in the exponent refer respectively to the 
cases where k, ex is a minimum and a maximum of n(e, k,).* We transform this 
expression by integration by parts, with 


dnex X J ‘ 
Je CP (2rilnex) de = prm dexp (27iInex(€)) 





and using the fact that the slowly varying function !0?n/Ok2'.. need not be 
differentiated. The integrated term does not give an oscillatory dependence on 


the field; omitting it, we have 





I etin/t exp (27iln.x) de : 
lo = Posen | [1 +exp ( „i )| | ô?n/ðk? ue (63. ) 
ò L 


where the summation is over all the extrema (whose significance is further dis- 
cussed below). 

The factor exp (2rtiln.„)in the numerator of the integrand is a rapidly oscillat- 
ing function of e. These oscillations reduce to nothing the integral with respect 
to « everywhere except in the region e— u, ~ T, where the denominator varies 
rapidly. The function n,,(e) itself varies smoothly in this region, and may there- 
fore be represented as 


Nex(E) © Nex(Mo) + Nex(Ha)(E — Ho); 


the factor | 62n/Ok? |=" is simply replaced by its value at e= u. Then, changing 
from integration with respect to € to one with respect to x = (e—p,)/T and 
replacing the lower limit — u„/T by — œ (since u/T > 1), we obtain* 
(es exp [27ilnex(Us) +4 i7] 
213/2 | O2n/Ok? R po 
t The saddle-point integral of the form fe dz is calculated by putting z = ue'™!$ or 
z = ue™ ‘it for a> 0 or a < 0, after which the integration with respect to u extends from 


— co [0 o0. 
+ We use the value of the integral 


sinh-? [272] Tnex(4o)]. 


This formula may be derived by considering the integral along a closed contour in the complex 
z-plane, consisting of the real axis, the straight line im z = 2x, and two “sides” at infinity; to 
ensure convergence on the latter, the real parameter « is replaced by a— i0. The integral along 
this contour is determined by the residue at the pole z = iz, whence we find J— e7] = 
= — 2mieT "t, 
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In summing this expression over a = +1, we can everywhere (except in the 
exponential factor) replace u, by u, since by the hypothesis (63.1) £B < u. In 
the phase (exponential) factor, however, this replacement is not allowable: 
since n,,(e) is large, even a relatively small change in its argument causes a con- 
siderable change of the phase. It is sufficient here, nevertheless, to expand 
n.,(u+ BB) in powers of £B, taking only the linear terms. The result is 


. iia 
ee Z sS exp [27ilnex(u) + zit] x 


BI2 | O2n/Ok? |2? y 
x sinh~! [222/Tn..()] cos [271BBE.xnex(4)], (63.10) 


where £x = (k, ex). It remains to elucidate the significance of the quantities 
appearing in this expression, and to substitute it in (63.5). 

According to the definition (63.7), the function n,,(e) is related to the extre- 
mum value S,,(e) of the cross-sectional area of the constant-energy surface 
S(e, k,) as a function of k,, and its value at € = u is the area of the extremal 
cross-section of the Fermi surface. As an illustration, Fig. 15 shows the extremal 





(two maxima and one minimum) cross-sections of a dumb-bell-shaped Fermi 
surface; they are perpendicular to the direction of the field, which is shown by 
the arrow. The summation over ex in (63.10) is over all extremal closed cross- 
sections of all sheets of the Fermi surface. To simplify the notation, we shall 
also use the cyclotron mass of the conduction electron in its motion along an 
extremal closed trajectory. According to the definition (57.6), this mass is 


h? f OS(e, kz) | A2 
mag -AE a E stew, 
27 | ðE Eua 27 


where S,,(e) = S(é, k, -x(€)); the second equation again follows because at the 
extremum point ôS(e, k,)/Ok, = 0. 

We thus have the final formula for the oscillatory part of the thermodynamic 
potential: 








= ASer 1 
Õ = ZÈ, (= D'Acos ( amBB +77) 


2V (mB)? | O°S(u, ke) |2 _2 
am [572 | Oke sinh 


A = In?Tm*/mßB, 





Q= cos zlm*£.x/m, (63.11) 


lex 
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where m is the actual mass of the electron, and the plus and minus signs in the 
argument of the cosine relate to minimum and maximum cross-sections respec- 
tively.‘ 

The magnetization M (the magnetic moment per unit volume) is calculated as 
the derivative* 


Meo. (63.12) 


In (63.11), only the most rapidly varying factors, the cosines, are to be differen- 
tiated. Because of the anisotropy of the Fermi surface (m* and S., depend on 
the direction of the field) the direction of M is not in general the same as that 
of B. For the oscillatory part of the longitudinal (along the field) magnetization 
we find 
_ { WS 1 
Z — 1+1 btas S 
M; = ¥(-1) Misin ( BB +3 a). 
OSCH, kz) |2 _2 
ðk? a Sinh, 





BHX mp)! Sex 
wlm 


The expressions (63.11) and (63.13) are complicated oscillatory functions of 
the magnetic field, and in general contain terms of various periodicities : terms 
originating from each of the extremal cross-sections of the Fermi surface have 
different periods with respect to the variable 1/B, namely 

1 4nmB_ _ 2 e| 


a = Sex, CHS ex 





cos zlm*čex/m. (63.13) 





Mı = 





; (63.14) 


These periods do not depend on the temperature. 

The temperature dependence of the amplitude of the oscillations is given by 
the factor 4/sinh 2. When 4 >> 1, the amplitudes decrease exponentially and the 
oscillations almost disappear. When å < 1, the factor 4/sinh å ~ 1, and the 
order of magnitude of the amplitudes is determined by the remaining factors in 
(2, and M;; all the following estimates refer to this case. 


t For a gas of free electrons, the Fermi surface is a sphere with radius kp = \/(2mw)/h, 
Sex = tk}, and formula (63.11) becomes that in Part 1, (60.5). 

+ The differentiation with respect to B needs explaining. Formula (63.12) may be derived 
as follows. The change in the Hamiltonian of the system due to an infinitesimal change in the 
vector potential of the field is 

6H =- [j.dAdV/c, 


where j is the current density operator; see QM, (115.1). The change in the thermodynamic 


potential 2 is found by averaging 6H for given values of „u, T and V. Since the quantization 
of the system is determined (as shown in §62) not by the exact microscopic field H but by its 


macroscopic mean value B, this means that in 8H also A is to be taken as the vector potential 
of the mean field B. The variation 5A can therefore be taken outside the averaging, after which 


6Q = (ôH) =— f (j). ôA d¥/c. 
Now with the magnetic moment defined by (j) = c curl M, integration by parts gives 
6Q =~ ôB. | Mav. 
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For a rough estimate we put m* ~ m, u ~ #k2/m,S ~ kẹ, where kp ~ 1/ais 
the linear dimension of the Fermi surface. Then 


2 ~ V(mpB)!2/4 ~ Vay Biu), M ~ nB(BB/u)¥?, — (63.15) 


where n ~ kê is the number density of electrons. The part M of the magnetiza- 
tion that varies monotonically with the field may be estimated by putting 


M ~ ZB ~ ĝ?mkpB/h? ~ ng.BBlu, (63.16) 


where y is the “monotonic” part of the magnetic susceptibility, estimated, for 
example, by means of the formula for the susceptibility of an electron gas in 
weak fields (see Part 1, §59). Accordingly, the monotonic part of the thermo- 
dynamic potential is O << VMB ~ Vnp(8B/u)*. A comparison of the above 
expressions shows that the oscillatory part of the thermodynamic potential is 
small in comparison with its monotonic magnetic part: 


ÖÖ ~ (BBI «1, 


and therefore in comparison with its value Qo ~ Vnu in the absence of the 
field: 0/20 < (8B/u)*”. The oscillatory part of the magnetization, on the other 
hand, is large in comparison with the monotonic part: 


MIM ~ (u BB)! > 1. 


Regarding the whole of the above theory of the oscillations of the magnetiza- 
tion, it should be noted that this applies to an electron liquid in an ideal crystal, 
and takes no account of a possible influence from processes of conduction elec- 
tron scattering by phonons and by lattice defects (e.g. impurity atoms). These 
processes cause an uncertainty in the electron energy, Je ~ h/t ~ ħvp/l, where 
t is the time between collisions, / is the mean free path, and vp is the electron 
velocity. The blurring of the sharp energy levels in turn smooths out the oscil- 
lations of the magnetization. The condition for it to be permissible to neglect 
scattering processes is that the uncertainty 4e is small in comparison with the 
intervals between the levels: 


hop > Avrll. (63.17) 


When T — 0 the values of B allowed by the condition (63.1) become arbitra- 
rily small, and only the condition (63.17) sets a limit. The magnetization M may 
in principle become comparable with the induction B itself (since M/B ~ 
~ ¥(u/BB)'"), but the magnetic susceptibility y = 0M/OH becomes large (in 
modulus) sooner :* since again only the oscillating factors need be differentiated, 
we have 


Iži ~ x(u/BB)P. (63.18) 


* To avoid unnecessary complications, the influence of anisotropy will be ignored in the 
following qualitative treatment. 
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In such a situation, the oscillations of the magnetization cause the curve of 
the macroscopic field H = B—4nM(B) as a function of the induction B to have 
a series of bends, as shown diagrammatically in Fig. 16 (A. B. Pippard 1963). 
The condition of thermodynamic stability requires that‘ 


(OH/OB)r, „n > 0. 





Fic. 16. 


Hence the states corresponding to parts of the curve such as bc cannot occur. 
The resulting situation is exactly similar to the one which causes a phase tran- 
sition in a substance when there is such a bend on the curve of pressure against 
volume (cf. Part 1, §§84 and 152). The equilibrium H(B) curve will in fact include 
a straight horizontal section ad, drawn so that the two shaded areas in Fig. 16 
are equal; the sections ab and cd correspond to metastable states. 

Let a metal sample have the form of a cylinder with its axis along the exter- 
nal field H. Then the field H within the cylinder is the sameas , and as the latter 
increases the body will undergo successive phase transitions with discontinuous 
changes of the induction: each time a point such as a is reached, the induction 
changes discontinuously from B, to B}? If, however, the sample is a flat 
plate in a magnetic field perpendicular to it, the body separates into alternating 
layers (diamagnetic domains) with different values of the induction, exactly like 
the separation of a superconductor in the intermediate state into normal and 
superconducting layers (J. H. Condon 1966). The external field is in this case 
equal to the magnetic induction averaged over all the layers. For example, in 
the range B, < H < B; the plate separates into layers with induction B, and B, 
and, as $ increases, the volume of the latter increases at the expense of the vol- 
ume of the former. 


§ 64. Electron-phonon interaction 


So far, we have considered the conduction electrons in a crystal without refer- 
ence to their interaction with the lattice vibrations, i.e. the phonons. This in- 
teraction represents the fact that the deformation of the lattice alters the field 


t Cf. ECM, §18, where a similar condition is derived for the electric case. 
+ We assume that the surface energy of the interface between the phases is positive. 
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in which the electron moves; this change in the field is called the deformation 
potential. 

The electron-phonon interaction plays a decisive role in transport phenomena 
in Semiconductors and metals, but here we shall be interested only in the quali- 
tative influence of this interaction on the energy spectrum of the electrons. 
To study it, we may usefully ignore the complications due to the anisotropy of 
the lattice and its microscopic inhomogeneity. Thus we regard the medium as 
a microscopically homogeneous isotropic liquid, and accordingly only longi- 
tudinal acoustic vibrations can occur in it. 

In the first approximation with respect to the deformation, the potential 
corresponding to this simplified model may be written 


Vest) = = f Wie- dx’, (64.1) 


where 0’ is the variable part of the density of the medium (and ọ is its constant 
equilibrium value). The function W(r—r’) decreases over distances of the order 
of the interatomic distances a. We shall simplify the expression (64.1) further 
by noting that, for interaction with phonons with wave numbers k< 1/a, 
these distances may be taken as zero, i.e. we assume that W = wé(r—r’) with w 
a constant. Then U,,, = wo’'(r)/o. In the quantum theory in the second-quanti- 
zation representation, this potential is written as the Hamiltonian of the elec- 
tron-phonon interaction, 


Bey = (wo) | PECE, r) 6'(t, 1) Ët, r)d?x, (64.2) 


where the operators Y and ¥* refer to the electrons, and ĝ' is the Heisenberg 
density operator describing the phonon field; for free phonons (not interacting 
with electrons), it is given by (24.10). 

In the mathematical formalism of Green’s functions, as applied to the elec- 
tron-phonon interaction, we have not only the electron Green’s function G but 
also the phonon Green’s function defined by 


D(X, Xa) = D(X, — Xa) = — KT 6'(X1) 6'(X2)), (64.3) 


the chronological product being expanded by the rule (31.2), which corresponds 
to the case of bosons. For free phonons, the Green’s function in the momentum 
representation is 
ok 1 1 
i a eres Trae 
ok? 


= Duki ie) 


see §31, Problem. (In these intermediate formulae, we put A = 1.) 
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If the electron-phonon interaction is regarded as a small perturbation, we 
can Set up a diagram technique based on the operator (64.2), as was done in 
§13 for a pair interaction of fermions. Without repeating all the arguments, 
we shall formulate the resulting rules for the construction of the diagrams (in 
the momentum representation). 

The basic elements of the diagrams are electron (continuous) lines and 
‘phonon (broken) lines, each having assigned to it a certain “4-momentum”. 
An electron line with 4-momentum P corresponds to a factor iGQ = 16,,G(P), 
the Green’s function of free electrons. A phonon line with 4-momentum K 
corresponds to a factor iD (K), the Green’s function of free phonons. Two 
continuous lines and one broken line meet at each vertex of the diagram; such 
a point has an additional factor —iw/o. 

For example, the first correction to the electron Green’s function is repre- 
sented by the diagram‘ 


K 
at oo) 
P-K 


with the corresponding analytical expression 
IÔG(P) = —(w?/o?)[GO(P)P | GO(P— K) D(K) d*K/(2x)*t. (64.6) 


The first correction to the phonon Green’s function is represented by the 


diagram 
r- O- a 


or, in analytical form, 
iôD(K) = 2(w®/o?)[D(K)}2 f G(P) GO(P—K) d*P/(2x)*; (64.8) 


the coefficient 2 comes from the contraction of the spin factors (6.064 = 2)» 
and we have also included the factor —1 due to the presence of one closed 
fermion loop (cf. §13). 

We shall show that the electron-phonon interaction in a metal leads to the 
occurrence of an “effective attraction” between electrons near the Fermi sur- 


t The structure of the expression (64.2) for the electron-phonon interaction operator is 
Similar to that of the electron—photon interaction operator in quantum electrodynamics. The 
rules of the diagram technique are therefore also analogous in the two cases. 

+ There is no diagram with a closed electron line, similar to (13.13a), because D(0) = 0. 
Here it is assumed that the limit k — 0 is taken before œ — 0. This corresponds to the fact 
that in coordinate space the integration over d°x (which in the present case signifies the passage 
to k — 0) is present in the definition of the Hamiltonian (64.2), and is therefore carried out 
before the integration with respect to time which arises when perturbation theory is applied 
to this Hamiltonian. 
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face. It can be intuitively described as resulting from the emission of a virtual 
phonon by one electron and its absorption by another (J. Bardeen 1950, 
H. Fröhlich 1950). 

Let us consider the diagram 


P(*Fitk A 


lx (64.9) 


a 


P3=Po-K Po 


representing the scattering of two electrons by the exchange of virtual phonons; 
the 4-momenta P = (e— u, p), K = (w, k), and u is the chemical potential of 
the electrons at T = 0, which is equal to the limiting energy ep. This diagram 
corresponds to the vertex function 


Lys, 2p = VS ay8s0, il = (- iw/o)? iDO(K), 


or 
w2k2 


Kon 0(w?— wk? + i0) ” 


(64.10) 


with fio = €,—&, Ak = pi— pı 

The momenta of electrons near the Fermi surface are in order of magnitude 
P ~ pp ~ Ala. Scattering of electrons through an angle ~1 corresponds to a 
phonon momentum #k ~ h/a and phonon energy fuk ~ hu/a ~ hw), where 
Wp is the Debye frequency; for metals, wp < ep. On the other hand, the 
electron cannot transfer an amount of energy greater than £— €p. Hence, if for 
both electrons | €— €p| <p, we certainly have 


I = wou? > 0. (64.11) 


Taking into account the significance of J’ as a scattering amplitude (§16), we see 
that its sign corresponds to attraction between particles. It must be emphasized 
that this result is valid only for electrons in a comparatively narrow shell (with 
energy thickness ~ wp) in momentum space near the Fermi surface. This fact 
has already been used in §43 to determine the value of the cut-off parameter in 
the theory of the superconductivity of metals". 


+ For a rough estimate of the constant w in metals, we may note that the change in the 
electron energy must be of the order of the energy itself (~ €p) when the density change 
o’ ~ o;hencew ~ Ep. 
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§ 65. Effect of the electron-phonon interaction on the electron 
spectrum in a metal 


Let us consider the influence of the electron-phonon interaction on the energy 
spectrum of electrons in a metal. 

It has been shown in §14 that for a Fermi-type spectrum the correction to the 
dispersion relation e(p) (in comparison with the spectrum of a system of free 


fermions) is 
de(p) = X(e— u, p)— = (0, p), (65.1) 


where X = [G]-1—G~? is the self-energy function. In the present case, the 
correction is due to the interaction with phonons, and the “unperturbed” 
spectrum is that which takes account of the “direct” interaction of the particles 
(electrons). According to (64.6)*, 


E(P) = — ôG- = 8G/[GO? = i(w%/o2) | G(P— K) DOK) d*K/(2z)', (65.2) 


but G® must now be taken as the Green’s function for electrons interacting 
with one another. Near its pole, such a function has the form 


GO(e—p, p) = Zle—n—W(p—pr)+i0.sgn(e—p)}-2; (65.3) 


see (10.2). The superscript (0) to vp denotes that this quantity does not yet 
include the effect of the electron-phonon interaction. 

Our object is now to obtain an estimate of the quantity (65.1), i.e. of the 
integral 


de = Sr [0% -u — w, p—k)— GO(— w, p—k)} Dw, k)d*K/(2z)*. (65.4) 


It will be seen from the subsequent calculations that the main contribution to 
this integral comes from the range in which the momentum p—k and the 
energy e—w (like p and e) lie within the Fermi surface, i.e. k <« pp, © «& u. 
For this reason we can take (65.3) as the functions G®. 

In spherical polar coordinates in k-space, with the polar axis parallel to p, 
we have dK = 2xk? dk dw dcos 0, where 0 is the angle between k and p. 
Instead of cos 6 we use the variable p, = 'p—k|; since p? = p?+k?—2pk cos 0, 


we have 
d*K = 20k? dk dw p, dp,/pk ~ 2k dk dw dp,, 


putting pı ~ p ~ pr. 
In the integrand in (65.4), only the factor in the braces depends on py; it is 
{...} = —(e— y) Zle — u — w— o(p, — pr) + i0.sgn (e — u — w)] -+ X 
x[-~ w- Pp —pr)— i0. sgn oj. 


t The results in this section are due to A. B. Migdal (1958). 
+ In the intermediate formulae, we put å = 1. 
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Because of the rapid convergence of the integral with respect to pı — pp, we Can 
extend the integration to + œ ; with the variable n = v©(p,1—p,), we obtain 
the integral 


[Ge 


EA | as ee ee ae 
vo [n —(€ —u—@)—i0.sgn(e—p—)][n +w +i0.sgn w] 


If the two poles of the integrand are on the same side of the real axis, the 
integral is zero (as may be shown by closing the contour of integration in the 
other half-plane). The integral is therefore non-zero only if e—~p >w >0 
or &— u < w < 0; it is —2niZ/v® in the first case and 2niZ/v®) in the second. 
Thus, using also the fact that D(w, k) is an even function of w, we find 

zw p e 1 1 
EALA | f aa. k2dw dk. (65.5) 
8x2our | w—uk+i0 w+uk—i0 
0 

The real and imaginary parts of this expression determine respectively the 
correction to the spectrum of quasi-particles (conduction electrons) and their 
decay. Let us first consider the latter. 

Separating from (65.5) the imaginary part by the rule (8.11), we find 


2 

inte sia2 = | eee: (65.6) 
Sxouv\) 

the integration with respect to k is taken from 0 to i e— u|/u, for which the 

pole w = uk of the integrand in (65.5) lies in the range from 0 to 'e— u'. Thus 

(in ordinary units) 

Zw? |e—u!3 

24nh out ` 


For a rough estimate of this quantity, we note that the parameters v® 
and w are of electronic origin and are expressed, in order of magnitude, in 
terms of only the interatomic distances a and the electron mass m: vf ~ 
~ prim ~ jma, w ~ ep ~ h?/ma?; see the last footnote to §64. The density 
e and the velocity of sound u depend also on the ion mass M: 9 c M,uocM SNE 
and so gut oc 1/M. Hence an estimate of the decay may be written 


—imde ~ |e—p/3/(hap)*, (65.8) 


— im ôe = (65.7) 


where the Debye frequency wp ~ uja oc M. 

Strictly speaking, the estimate (65.8) relates to values !e— u| <fwp, for 
which the integration in (65.6) is taken over the range k < le— u! /uñħ < wp/u, 
in which the phonon dispersion relation w = ku which we have used is in fact 
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valid. For a rough order-of-magnitude estimate, however, we can apply (65.8) 
even at the limit of the range where e— u ~ fiwp, obtaining 


—im de ~ wp ~ |e—p|. (65.9) 


Lastly, when e— u > fiw), the range of integration in (65.6) is independent 
of e— u, since the pole w = uk S wp is always between 0 and «— u. In this case, 
f k? dk ~ (wp/u}, and the decay is 


—imde ~ wp « £€— pu. (65.10) 


The expressions (65.8)-(65.10) give the specific decay due to the emission of 
phonons by electrons.' We see that, in the immediate neighbourhood of the 
Fermi surface, when |£— u| «<fiwp, according to (65.8), the decay is slight 
(jim (e— u)! « | e— ul), so that the concept of quasi-particles (conduction 
electrons) has an entirely definite meaning. In the range |e— u| ~ œp, how- 
ever, the decay of the quasi-particle becomes comparable with its energy, and 
the spectrum is blurred and loses much of its meaning. At still larger distances 
above the Fermi surface, when e—y >> fiw, (but of course still e— u <« yp), 
according to (65.10), the decay remains the same in absolute magnitude but 
again becomes small in comparison with the energy e— u, so that the quasi- 
particles recover a certain significance. Of course, as well as the phonon decay 
of the conduction electrons, there is always also the decay due to collisions 
between electrons. The latter decay, characteristic of any normal Fermi liquid 
(§1), is proportional to (e— u)? and in order of magnitude is ~(e— u}?/u, 1.e. 
is always small in the range where the theory is applicable. 

Let us now estimate the correction to the real part of e, i.e. to the spectrum 
itself. The real part of the integral over w in (65.5) is given by its principal 
value: 


jeu [e—a] 
o) s eee ee 
re | Dw, k) dw 2i E E dw 
0 0 


_ ok e—p—uk | 
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We therefore have for re de (in ordinary units) 


rede = —2™ _ [ kelogl TEŽ | ay, (65.11) 
8x2our | i e—uthuk | 


When £— u >> wp, the logarithm in the integrand is ~ fuk/(e— u), and the 
whole integral is estimated as fuk®,,./(e—u) ~ hu/a®(e—u). Noting also that the 


t The conservation of energy in the creation of a low-frequency phonon by a quasi-particle 
is expressed by (0¢/0k) .ok = v.k = udk; this can be true only if v > u. In a metal this con- 
dition is always satisfied, since vp >> u. 
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factor ọ in the denominator of (65.11) makes the whole expression oc 1/M, we 
arrive at the estimate 
re de ~ (hfwp)*/(e—) « £— u. 


Thus in this case the correction to the spectrum is relatively small, and so the 
spectrum is given by 


e—uxv(p—pr) for e—u> hop, (65.12) 


with the “unperturbed” value v® of the velocity on the Fermi surface. 

In the range e—u <« wp, the logarithm in (65.11) is ~(e—)/fuk, and the 
integral is estimated as (e— u) kŽax/ħu ~ (e—)/hua®. The whole expression 
(65.11) is thus proportional to e— u, with a coefficient independent of the ion 
mass M (since the product oz? is independent of M). This means that the spec- 
trum in that range is again of the same type: 


€— u ~ vr(p—pr) for e-u<hop, (65.13) 


but with the velocity vp, which differs from v® by an amount of the order of 
v® itself.t 

Thus the Fermi-type spectrum for electrons in a metal has two different 
values of the velocity, vp and v®, one in the immediate neighbourhood of the 
Fermi surface (¢— u << wp) and the other at e— u >> wp. The thermodynamic 
properties of the metal at low temperatures (T < wp) involve the parameter 
vy from (65.13). Such phenomena as the optical properties of the metal at 
frequencies w >> wp are determined by the velocity v@’. 


PROBLEM 


Determine the decay of long-wavelength (k « pp) phonons in a metal due to their 
absorption by electrons. 


SOLUTION. The correction to the Green’s function of the phonons is given, according to 
(64.8), by 
i8D-\(K) = —(2w*/o*) | GP) GP- K) d'P |27), 


P = (Pos p), K = (w, k). 


In the G functions, however, we must also include corrections due to the interaction of elec- 
trons with short-wavelength photons. According to the discussion in the text, these changes 
simply replace G by a function G that differs from (65.3) only in that the velocity v is 
changed to vp, and the renormalization constant Z to another one Z’. For small K, formula 
(17.10) may be used for the product G(P)G(P— K). The integration over dpodp reduces to 
the removal of the delta functions, after which there remains the integration over d cos 0 
(where 6 is the angle between p and k): 


1 
_ Zwippk cos 0 d cos 0 
270? w—Vpk cos 0 -+ i0 


6D-! (w, k) = 


t In these conditions, of course, the use of the first approximation of perturbation theory 
is not, strictly speaking, legitimate. The use of higher approximations, however, cannot alter 
the nature of the result: when the first-order correction becomes of the order of unity, the 
remaining corrections are also of that order. 
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(we take w > 0). The pole at cos 0 = w/kv, is within the range of integration (since vr > u), 
and the imaginary part of the integral is 


im 6D-! = Z’w*p2w/2no*vek. 
The phonon dispersion relation is found from the equation [D]-?+6D71 = 0, whence 
(in ordinary units) 
w = uk(i—ia), a = Z?wp2/4nh our}; 


the correction to the real part ofw is not of interest here. The product eu oc yM; hence, as a 
rough estimate, a ~ +/(m/M), i.e. the decay is always small. 


§ 66. The electron spectrum of solid insulators 


A characteristic feature of the electron energy spectrum of a non-magnetic 
insulating crystal is that even the first excited level is at a finite distance from the 
ground level, i.e. there is an energy gap (which in ordinary insulators is of the 
order of a few electron-volts) between the ground level and the spectrum of 
excited levels. 

An elementary excitation in an insulating crystal may be visualized as an 
excited state of an atom but one that cannot be assigned to any particular 
atom; the translational symmetry of the lattice leads, as always, to a “collecti- 
vization” of the excitation propagated in the crystal as if it were hopping from 
one atom to another. As in other cases, these excitations may be regarded as 
quasi-particles (which are here called excitons) with definite energies and quasi- 
momenta. Like all quasi-particles that can occur singly, the excitons have in- 
tegral angular momentum and obey Bose statistics." 

For a given quasi-momentum k, the exciton energy can take a discrete series 
of values ¢,(k). When the quasi-momentum takes values in one cell of the recip- 
rocal lattice, each of the functions ¢,(k) covers a certain band of exciton energy 
values; different bands may partly overlap. The minimum value of each func- 
tion (k) is non-zero. 

The insulator may contain, as well as excitons, electron excitations of another 
kind. These may be regarded as resulting from the ionization of individual 
atoms. Each such ionization causes the occurrence in the insulator of two 
independently propagated quasi-particles, a conduction electron and a “hole”. 
The latter is an absence of an electron in an atom, and so behaves as a posi- 
tively charged particle. Here again, in speaking of the motion of an electron 
and a hole, we are really referring to certain collective excited states of electrons 
in the insulator, accompanied (unlike exciton states) by a transfer of a negative 
or positive unit charge. 

The electrons and holes have half-integral spin and obey Fermi statistics. 
We must emphasize, however, that the electron-hole spectrum of the insulator 
is not at all like the Fermi-type electron spectrum in metals. The latter is char- 


t The concept of excitons was first introduced by Ya. I. Frenkel’ (1931). 
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acterized by the existence of a limiting Fermi surface in k-space, near which 
the electron quasi-momenta lie. In the present case, however, there is no such 
surface, and the electron and hole that appear simultaneously can have any 
quasi-momenta. 

The distinction between the two types of spectra can be more fully understood 
by considering the decay of elementary excitations. In a Fermi liquid, any quasi- 
particle outside the Fermi surface can create pairs of new excitations (a particle 
and a hole), and therefore has a finite lifetime which decreases rapidly away 
from the Fermi surface (an electron in a metal can also emit phonons; see 
§65). The decay of an individual electron (or hole) in an insulator, in an ideal 
lattice (at T = 0) is, however, precisely zero in a finite range of energies above 
its minimum value: the formation of an electron-hole pair always requires a 
finite expenditure of energy, because of the energy gap 4 (see below). The 
emission of an acoustic phonon by a quasi-particle is possible only if the latter’s 
velocity v is not less than the velocity of sound u; see the third footnote to §65. 

The possible values of the conduction electron energy (k) and hole energy 
e™(k) also occupy bands. The width of the energy gap in an insulator is usually 
taken as the sum 4 = ©) +e of the smallest possible values of the electron 
and hole energies. Since the electron and the hole appear or disappear together, 
it is this sum which has a real significance, not e} or ®}, separately; it is 
usual to take arbitrarily «, = 0. The minimum energy values can be reached 
for electrons and holes with the same or different values of the quasi-momen- 
tum k = ko; the gap is said to be respectively direct and indirect. If the energy 
levels in the band are not degenerate (or have only double degeneracy with 
respect to the spin because of the symmetry under time reversal), e(k) near its 
minimum has the form 

ek) = A+ 5m gign, E(k) = m1 Gig, (66.1) 
where q = k—ko, and m® and m® are the electron and hole effective mass 
tensors. 

In the literature, the electron band is often called just the conduction band, 
and the hole band is referred to as the valency band, which in the ground state 
of the crystal is completely filled with electrons. The formation of a pair of 
quasi-particles (electron and hole) is then regarded as the result of the passage 
of an electron from the valency band to the conduction band, leaving a hole 
at its original position. 

At large distances (compared with those between the atoms), the electron 
and the hole attract each other in accordance with Coulomb’s law. They can 
therefore form bound states. The electron and the hole together constitute an 
electrically neutral quasi-particle, i. e. an exciton. Fora given quasi-momentum, 
the bound states correspond to discrete energy levels of the electron—hole system, 


t At finite temperatures there is, of course, always a decay due to scattering by other quasi- 
particles. 
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with one exciton energy band for each level. The energies of the excitons 
thus lie below those of electron-hole excitations; the energy gap in the sense 
mentioned at the beginning of this section is consequently not the same as 4, 
but is less by an amount equal to the maximum binding energy of the exciton.* 

The exciton energy levels are easily calculated in the limiting case of weakly 
bound states, when the mean distances between the electron and the hole are 
large compared with the lattice constant a; this is called a Wannier-Mott 
exciton. In the opposite limiting case, when the electron-hole distance is of the 
order of atomic distances, we have a Frenkel’ exciton, which can of course be 
regarded only formally as a bound state of electron and hole. 

Let us consider an insulator crystal having cubic symmetry. For a Wannier- 
Mott exciton, we can suppose that the electron and the hole attract each other 
according to Coulomb’s law, the remaining atoms in the lattice serving only to 
create a uniform dielectric background which weakens the interaction by a 
factor e, the permittivity of the crystal (taken for the frequencies that corre- 
spond in order of magnitude to the exciton binding energy); thus the electron- 
hole interaction energy is written as U = —e*/er. Let the gap in the spectrum 
be direct, and let us assume for simplicity that the electron and hole energy 
minima are at k = 0. In a cubic crystal, the effective mass tensors reduce to 
scalar constants m, and m,, so that 


ek) = A +AKY2me, E(k) = #k?/2my,. (66.2) 


It has been mentioned at the end of §56 that the motion of the particle in a 
crystal lattice subjected to an external electric field varying slowly in space is de- 
scribed by Schrédinger’s equation with a Hamiltonian in which e(k) plays the 
part of the kinetic energy. Since in the present case the functions e©(k)— A and 
e(k) are the same in form as the kinetic energies of ordinary free particles, 
Schrédinger’s equation for the system in question is the same in form as that 
for a system of two ordinary particles with Coulomb interaction, i.e. Schré- 
dinger’s equation for the hydrogen atom. We can therefore write down imme- 
diately the energy levels of the system, i.e. the exciton energy, as 


hrk? met 


ex ss = ee ee a 
A aeS (me +m) 22h?n? 


(66.3) 
(G. H. Wannier 1937). The first term in this expression is the energy of the 
exciton moving “as a whole” with quasi-momentum k. The second term gives 
the binding energy of the electron and hole in the exciton; m = myn,/(m,+m,) 
is the reduced mass of the system. For a given k, the discrete energy levels of 
the system become closer as the energy increases to the limit of the continuous 
spectrum. The condition for (66.3) to be valid is that the orbit radius should be 


t Exciton states, however, have a finite lifetime, since the electron and hole can recombine 
with the emission of, for example, a phonon or a photon. 
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sufficiently large, r, ~ f®en®/me® >> a. This is certainly true for large n, but 
in crystals with large it may also be satisfied for n ~ 1." 

To conclude this section, let us return to the assertion in §61 that there is a 
lower limit to the number density of conduction electrons in a semi-metal. 

In an insulator, where there are no electrons and holes at T = 0, the possibil- 
ity that they can form bound states means only the occurrence of new branches 
of the energy spectrum. In a compensated metal, this possibility would 
mean that a state with free electrons and holes is not the lowest state, i.e. a 
spectrum of the metallic type would be unstable. The possibility of forming 
bound states is eliminated by the screening of the Coulomb interaction be- 
tween the electron and the hole by other quasi-particles lying “between them”. 
That is, the mean distance between the quasi-particles must be of the order of 
or less than the exciton size r,, (in its ground state). The lower limit of electron 
and hole number densities in a metal, set by this condition, decreases with their 
effective mass. 


§ 67. Electrons and holes in semiconductors 


The energy spectrum of pure (or intrinsic) crystalline semiconductors differs 
from that of insulators only quantitatively: the gap 4 is smaller, and there is 
therefore a considerable carrier density (in comparison with an insulator) in a 
semiconductor at ordinary temperatures. The distinction is obviously arbitrary, 
and also depends on the temperature range concerned.’ In impurity (or doped) 
Semiconductors, the impurity atoms are a further source of electrons or holes, 
the energy gap of these atoms for electron transfer to the lattice (donor impu- 
rity) or from the lattice (acceptor impurity) being less than the energy gap in 
the original spectrum. 

Let us consider further the relation between the gap 4 and the density of con- 
duction electrons and holes in a semiconductor (or insulator). 

The formation or disappearance of an electron (e) and hole (h) pair may be 
thermodynamically regarded as a “chemical reaction” e+h = 0, the ground 
state of the crystal acting as a “vacuum”. From the general rules (see Part 1, 
§101), the condition of thermodynamic equilibrium in this reaction is 


Me+ uin = 0, (67.1) 


where u, and u, are the chemical potentials of the electrons and holes. Because 
of the comparatively low densities of electrons (n,) and holes (n,) in a semicon- 


t It is noteworthy that, near the upper edge (maximum) of the band, where the effective 
masses are negative, bound states of two electrons (or two holes) may be formed. The energy 
of such states lies in the forbidden region above the maximum total energy of the electrons. 

+ The values of the energy gap 4 for several semiconductors are: silicon 1.17 eV, germa- 
nium 0.74 eV, indium antimonide 0.24 eV, gallium arsenide 1.52 eV, lead sulphide 0.29 eV. 
For diamond, a typical insulator, 4 = 5.4eV. 
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ductor (at T « 4), the Fermi distribution for them is almost exactly the Boltz- 
mann distribution, so that the electrons and holes form a classical gas.t Then 
the law of mass action follows in the usual way (see Part 1, §101) from (67.1); 
according to this law, the product of the equilibrium densities is 


nen, = K(T), (67.2) 


where the right-hand side is a function of temperature depending only on the 
properties of the host lattice, at whose atoms the electrons and holes are creat- 
ed and annihilated; this function is independent of the presence or absence 
of impurities. Let us calculate the function K(T) for the particular case where 
the electron and hole energies are quadratic functions of the quasi-momentum 
(66.1). 

The quasi-momentum distribution of the electrons (per unit volume) is 
given by the Boltzmann formula 


He— E(k) dèk 
exp | | Ga 


the factor 2 taking account of the two directions of the spin. The energy distri- 
bution is given by the substitution 


dk «2m 
Que hs 





/(€e—4) dee, 


where m, = (mymzm3)“3 and mı, me, m; are the principal values of the effective 
mass tensor m$). The total number of electrons per unit volume is thus 


Ne = ore | (Ee — A) e~*!T dee; 


the integration can be extended to infinity, because of the rapid convergence. 
Calculation of the integral gives 








3/2 
te = 2 ( a) etu —AIT, (67.3) 
Similarly, 
3/2 
m = 2 ( ra en, (67.4) 


Finally, multiplying the two expressions and using (67.1), we obtain the re- 
quired result: 
(m.m,)?!2 


Seg Teer, (67.5) 


Neh = 


t The electron and hole densities in semiconductors at ordinary temperatures are 1012- 
10'? cm~*, whereas in metals they are 1077-1023 cem~3. 
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In an intrinsic semiconductor, where all the electrons and holes are formed in 
pairs, 


3/4 
Ne = Nh = gr Teer. (67.6) 


Equating (67.6) and (67.3), we find the chemical potential of the electrons:' 


Me = 4A+23T log (m/m). (67.7) 


The contribution of the electrons and holes to the thermodynamic quantities 
in a Semiconductor is exponentially small when T « 4. Since the creation of 
one electron-hole pair requires an energy close to 4, we have as the electron- 
hole contribution to the internal energy Ey, ~ Vn,4 with n, from (67.6). This 
may usually be neglected in comparison with the lattice contribution to the 
crystal energy. 


§ 68. The electron spectrum near the degeneracy point 


In this section, we shall use simple examples to show how symmetry argu- 
ments may give the form of the energy spectrum of electrons or holes in a semi- 
conductor or insulator near certain points in k-space (the reciprocal lattice) 
that are distinguished by their symmetry.’ 

Let us consider a lattice of the cubic crystal class O,, and the properties 
of the energy spectrum near the point k = 0, a vertex of a cubic cell in the 
reciprocal lattice; this point has the symmetry intrinsic to the complete point 
group O,. 

AS a first example, let us take the spectrum without allowance for the electron 
spin, and let the energy level in the band, belonging to the irreducible represen- 
tation E, of the group O,, be doubly degenerate at the point k = 0 itself. 
Away from the point k = 0, the degeneracy is removed, and the problem is to 
find all branches of the dispersion relation e(k) near this point. 

It has been shown in §59 how the departure from some point k = ko in 
k-space may be regarded as a perturbation. The specific form of the perturbation 
operator is, in the present case, unimportant. It is sufficient to know just the 
structure of the expressions that give the correction to the energy in each order 
relative to the small quantity q = k—ko; in our case ko = 0, and so q = k. To 
the first order, the corrections are determined by the secular equation formed 
from the matrix elements (for transitions between states belonging to the same 
degenerate level) of an operator having the form k.Ẹ, where ¥ is some vector 


t In the literature this is often called the Fermi level, but the chemical potential of the elec- 
trons in a semiconductor does not have the significance of a limiting energy as in metals. 

+ When the electron spin is neglected, this problem is formally identical with that of the 
energy spectrum of phonons in a crystal; see Part 1, 8136. 

$ For the notation for representations of point groups see OM, §§95 and 99, 
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operator. In the present case, because the symmetry group contains a centre 
of inversion, all the matrix elements of Ẹ are certainly zero, so that there is no 
effect of the first order in k (cf. Part 1,§136). To the second order in k, the cor- 
rections to the energy are determined by the secular equation formed from the 
matrix elements of an operator having the form 


VP = Dikike, (68.1) 


where y,, is an Hermitian tensor operator (symmetrical in the suffixes i and k). 
These include the corrections from the terms linear in k in the Hamiltonian in 
second-order perturbation theory and the corrections from the terms quadratic 
in k in first-order perturbation theory. The matrix elements of the operator 
(68.1) certainly include some that are not zero, but relations exist between them 
because of the requirements of symmetry. 

As regards their transformation law under the symmetry operations, the 
wave functions which form the basis of the representation E, can be taken in 
the form 

pi ~ P+oy?+022z?, pe ~ +02 +2%, 
where 
w = eiB, o? =w, 1+0+o7 = 0, 


and ~ denotes “is transformed as”. The rotation C3 about a diagonal of the 
cube transforms the coordinates by x, y, z — z, x, y; the functions y; and ye 
are transformed as shown by 


C3: Yi > WY1, Ya > wpe. 


The rotation C% about an edge of the cube (x, y, z > x, —z, y) transforms the 
functions as follows: 


C: Yi > pa Ye yp; 
and so on. Under inversion, the coordinates x, y, z change sign; yi and %2 
are unchanged. 
From this, we easily conclude that all the matrix elements of the non-diago- 


nal components y,, are zero, and the matrix elements of the diagonal compo- 
nents reduce to two independent real constants: 


(1 | yex| 1) = (21 Yax] 2) = CL] yy] 1) =... = 4A, 
(1 | ¥xx[ 2) = (2 | Yxxl 1) = B, 
(1 | yyy 12) = œB, (1 | yz2| 2) = œB. 


The matrix elements of the operator (68.1) are now 


(IV 11) = 21712) = 4k, 
(11V|2) = 2/V| 1)" = Be +ak2 +o). 
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Forming and solving the secular equation from these matrix elements, we ob- 
tain two branches of the spectrum: 


€1,2(k) — e(0) = Ak? + B[ki— 3(K2K2-+ k2k2 + k?kZJ!!2. (68.2) 


The degeneracy is removed on departure from the point k = 0 in any direction 
except the diagonal of the cube (k, = k, = k,)." 

As a second example, let us consider the spectrum taking account of elec- 
tron spin; the energy levels then correspond to two-valued (spinor) represen- 
tations of the symmetry group. Let the level be fourfold degenerate at the point 
k = 0, corresponding to the irreducible representation D, (or D;) of the group 
O,.* 

The base functions of this representation may be chosen so as to transform 
like the eigenfunctions yp}, (m = —j, ...,j) of the angular momentum j = 3/2.5 
This enables us to use the following procedure, which considerably simplifies 
the solution of the problem (J. M. Luttinger 1956). 

For a four-dimensional representation, the matrix of the operator (68.1) is 
4x4, with 16 elements. Any such matrix can be represented as a linear combi- 
nation of 16 given linearly independent 4 x4 matrices, which we take as the 15 
matrices comprising Jy, Jy» ies Jy) te Ue J 2—7] and those obtained from these 
five by cyclic permutations of the suffixes x, y, z, together with [/,, [/,,J]4].3 
[...], denotes the anticommutator. Here f, J,» Jz are the matrices of the Car- 
tesian components of the angular momentum j = 3/2, taken with respect to 
the four functions y®”. On the other hand, with that choice of the base functions 
we must assume that the operators f, J, J, are transformed under rotations 
and reflections as the components of an axial vector. This enables us to write 
the operator V, quadratic in k,, k, and k,, as a combination of expressions in- 
variant under all transformations in the group O,: 


V = Bik? + 4Bo(k2f2+ k2j2+ 22) 
+Ba(Kakyl fas Syla + kyky, Sela +kxkel fe Jxl+), (68.3) 


where f:, P2, P3 are real constants. 
The matrix elements of the operator (68.3) with respect to the functions 


pi ~ Yam Yor vi, pa~ pao, ypa ~ yr 20 


t The same result (68.2) is obtained for the representation E, (at the point k = 0). The dis- 
persion relation near a given point is indeed always the same for representations differing only 
bya multiplication by any one-dimensional representation of the group (here, E, = E,X Azu). 
It is evident that in such cases the matrix elements for transitions between different base func- 
tion are in one-to-one relation. 

t Such a situation occurs for the bottom of the hole band in diamond, silicon and germa- 
nium, which all have the same type of lattice. 

$ In QM,§99, Problem, it is shown that the irreducible representation D*!2) of the complete 
rotation group remains irreducible with respect to the group O, and is the same as its represen- 
tation D’. 
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are now easily calculated from the well-known matrix elements of the angular 
momentum, given in QM, (29.7)-(29.10). The calculation gives 


Vir = Vag = (81+ 382) (KE +43) + (814 982) k?, | 
Voo = Vag = (B1+7B2) (kå + k5) + (81 +82) ki, 


Vin =—Vy = 4V 3psk:(ky + tks), (68.4) 
Vis = Vu = 2/3 8(K3—k2) + iv 3Pgikxky, 
Vig = Veg = 0. J 


The construction of the secular equation can be simplified by noting that the 
splitting of the level certainly cannot be complete; there must remain a twofold 
(Kramers) degeneracy. This means that each root 4 = «(k)— (0) of the secular 
equation (each eigenvalue of the matrix P) is double. Thus to each eigenvalue / 
there will correspond two linearly independent sets of quantities ¢, (n = 1, 2, 
3, 4), solutions of the equations 


>> Vin Pm > An (68.5) 


By combining these two sets, we can therefore impose one further condition on 
the ¢,, in particular make one of them vanish, say œ, = 0. Then equation 
(68.5) with n = 4 gives 

Vidi t+ Vab +V ahs = 0. 


Substituting $; from this in the equations with n = 1 and 2, we obtain two 
homogeneous equations in two unknowns @; and ġ32: 


Garras Ya-varaive) (a) = 0) 


the equation with n = 3 gives nothing new. Thus the problem of the eigenvalues 
of the 4x4 matrix is reduced to the problem for a 2x2 matrix. Forming and 
solving its secular equation, with the values of V „„ from (68.4), we find 


2 = HV i+ Voo)t [4(V11—-Vo9)?+ | Vie [2+] Vig TA 
or finally 
€1,2(k)—e(0) = Ak? [Bk4+C(k2k2 + k2k2 + k2k2) 22, (68.6) 


where 
A = fit5po,, B= 168, C= 3(4682— 1682) 


(G. Dresselhaus, A. F. Kip and C. Kittel 1955). The level is split on departing 
from the point k = Oin any direction. 


t The application of perturbation theory to the states of only one degenerate level presup- 
poses that the intervals e(k)— (0) in the resulting splitting are small in comparison with the 
distances of the adjacent bands, including those which are split off by the spin-orbit inter- 
action. 
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Let us briefly consider the form of the equations describing the behaviour of 
particles near the degenerate bottom of a band in a magnetic field. We shall take 
a particular case, the second of those considered in this section, i.e. the spectrum 
(68.6). 

The direct application of the Hamiltonian formed from (68.6) by the general 
rule (56.7) would encounter difficulties because the spectrum is not analytic 
near k = 0. These difficulties can be avoided by making the change k — £ = 
K—eA /fic not in (68.6) but in the matrix Hamiltonian (68.3) (with symmetriza- 
tion with respect to the components of k, in order to preserve the Hermitian 
property). Each matrix element of the Hamiltonian is then converted into a lin- 
ear differential operator which acts not only on the spin indices but also on 
the arguments of the functions ¢,(K) in the equations (68.5); these are thereby 
transformed into a set of four linear differential equations. 

To take account of spin effects in the presence of a magnetic field, we must 
add to the Hamiltonian (68.3) terms which depend directly on H and are not 
determined by considerations of gauge invariance. Since the field is assumed 
weak, the terms added must be linear in H; also, because k is assumed small, 
they must be independent of k (cf. §59). In the present case, the general form of 
such terms, invariant under all symmetry transformations of the crystal, is 


BH .5+83(H.f8+ H, f+ Hf). (68.7) 


To conclude this section, we may mention the interesting situation that arises 
if one of the bands meeting at the degeneracy point ko is the conduction band 
and the other the valency band. The energy gap in such a spectrum is zero; an 
infinitesimal amount of energy is enough to create an electron and a hole with 
momenta close to ko. Such crystals are in a certain sense intermediate between 
insulators and metals. There is no energy gap, but the electron and hole states are 
separated except at one point in k-space. We may say that it is a metal in which 
the Fermi surface is shrunk to one point ko. At T = 0, such a gapless semicon- 
ductor’ has no carriers, but at low temperatures their number increases by a 
power law (not exponentially). The form of the spectrum near the point ko can- 
not be established from symmetry arguments alone; the Coulomb interaction 
of the electrons and holes causes a singularity in the matrix elements of the per- 
turbation at that point. 


t One example is a modification of tin called grey tin. 
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§ 69. Equation of motion of the magnetic moment in a ferromagnet 


THE magnetic structure of crystals generates specific branches of the energy 
spectrum. In proceeding to investigate such spectra, let us first recall some fea- 
tures of interactions in magnetic bodies. 

The chief form of interaction in ferromagnetic substances is the exchange in- 
teraction of the atoms, which establishes the spontaneous magnetization. A char- 
acteristic property of this interaction is that it is independent of the orienta- 
tion of the magnetization relative to the lattice: the exchange interaction results 
from the electrostatic interaction of the electrons and the symmetry of the wave 
function of the system, and does not depend on the direction of the total spin." 

The simplest ferromagnetic system is an insulator whose crystal lattice con- 
tains atoms having a magnetic moment, the sign of the exchange interaction 
being “ferromagnetic”, i.e. such that the parallel position of the moments is 
energetically favourable. The ground state of the system is then the one in 
which all the spins are parallel. More precisely, in this state the projection of the 
total spin of the system on some direction has its greatest possible value Z's, 
(summed over all atoms), where s, is the spin of one atom. For the exchange in- 
teraction Hamiltonian H,,.,, commutes with the operator S of the total spin of the 
system, and therefore with its component $, as follows from the facts that Asen 
is independent of the direction of the spins, and Ŝ is the rotation operator in 
spin space. Hence the ground state must have a definite value of S,, and the 
maximum S, corresponds to the minimum energy. Then s, and the component 
S, of the spin of each atom have their maximum values, so that the mag- 
netic moment in the ground state is equal to its “nominal” value Xu, where 
Ha is the magnetic moment of one atom. This property however, is destroyed 
by weaker (relativistic) interactions. 

In more complex cases, the magnetization of the body is not equal to its 
nominal value. In particular when the interaction is not ferromagnetic between 
all atoms, structures can be formed of two oppositely magnetized sub-lattices 
whose magnetizations are different and which therefore are not completely 
compensated; substances with such a structure are called ferrites, while the case 
of complete compensation corresponds to the antiferromagnets. 

t Experimental results on the gyromagnetic ratios g, which are very close to 2 for ferro- 
magnets, indicate that ferromagnetism is of spin origin. 
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Lastly, in a ferromagnetic metal we cannot treat the spins of the atoms inde- 
pendently of the conduction electrons, which are never (because of Fermi 
degeneracy effects) completely magnetized even at T = 0. 

As in any macroscopic system, weakly excited states of a ferromagnet may 
be regarded as an assembly of elementary excitations, a quasi-particle gas. The 
elementary excitations in an ordered distribution of atomic magnetic moments 
are called magnons. Since we are dealing with quasi-particles in a crystal lattice 
with translational symmetry, the magnons have definite quasi-momenta (tak- 
ing values in one cell of the reciprocal lattice), not actual momenta. In the clas- 
sical picture, the magnons become spin waves, i.e. oscillations of the magnetic 
moments, propagated through the lattice. The magnons obey Bose statistics, 
and large occupation numbers of the magnon states correspond to the classical 
limiting case of spin waves. 

If the spin wavelength is large compared with the lattice constant a, i.e. the 
wave number k « 1/a, the spin wave may be treated macroscopically; the dis- 
persion relation w(k) of the waves is then expressed in terms of phenomeno- 
logical parameters (material constants) that occur in the macroscopic equations 
of motion of the magnetic moments. Thus the magnon spectrum e = fiw(k) is 
also expressed in terms of these parameters. Such a method of defining the 
magnon spectrum is exactly analogous to the definition of the long-wave phonon 
spectrum in terms of macroscopic parameters (elastic moduli) which appear in 
the macroscopic equations of vibrations in sound waves. To carry out this 
programme, we must first derive the relevant equations of motion.’ 

Let us first consider just exchange interactions. Since we are concerned with 
weakly excited states ofa ferromagnet (and only their properties can be elucidated 
in a general form), we must take just the “slow” (low-frequency) motions of the 
magnetic moment. These are movements in which the direction of the magnetic 
moment varies slowly in space, its magnitude remaining constant: the equilibrium 
magnetization is fixed by the exchange interaction, and so its variation must 
involve a finite expenditure of energy at any wavelength (we assume that the 
body is sufficiently far from its Curie point, at which the spontaneous magnet- 
ization becomes zero). On the other hand, the energy is unchanged when the 
magnetic moment of the body as a whole is rotated; hence a rotation of the 
magnetization that is not uniform throughout the body requires an amount of 
energy that decreases with increasing wavelength (i.e. long-wave vibrations 
have a low frequency). The equation for the magnetic moment density (magnet- 
ization) M must therefore be such as to conserve |M !: 


oM/at = 2XM, (69.1) 


t The subsequent results in this section are due to L. D. Landau and E. M. Lifshitz (1935). 
They are valid for “exchange” ferromagnets. We shall not deal here with what is called weak 
ferromagnetism, where the ferromagnetic moment is due only to relativistic interactions. 
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where Qis the angular velocity of precession of the moment, which we shall now 
determine. We write the equation of motion as a differential equation of the 
first order in the time, since at low frequencies the higher derivatives may be 
neglected. 

To determine $2, we must use the fact that, at long wavelengths and low tem- 
peratures, the dissipation of energy when the magnetization varies is small and 
may be neglected; this assumption will be justified at the end of §70. To deter- 
mine the form of the condition for dissipation to be absent, we shall regard the 
magnetic moment of the magnet as an independent parameter whose equilib- 
rium distribution is found by minimizing the free energy. We shall carry out the 
minimization for constant temperature, constant volume of the body, and con- 
stant field H at every point of the body; the thermodynamic potential with re- 
spect to these variables is the free energy F.t The variation ôF for an infinitesi- 
mal change of M may be written 


OF =— | Her. OM dV, (69.2) 


with the notation H,, for the “effective field” by analogy with the expression 
for the energy of the magnetic moment in an external magnetic field. In equi- 
librium, H.g = 0. 

The dissipation of energy when the magnetization varies with time is calcu- 
lated as the derivative 


Q = TOS/dt = —ORmin/Ot = — OF /A1, 


where S is the entropy of the body and Rmin the minimum work needed to 
bring the body into a given non-equilibrium state. From (69.1) we thus have 


Q = f| Her. (M/N) dV = f Her. QXM dV. (69.3) 


Hence it is seen that in the absence of dissipation the vector $2 must be parallel 
to the vector H., so that we can put Q = constant xH,,. Then equation 
(69.1) becomes 

M = constant X Hee XM; (69.4) 


he significance and value of the constant will be ascertained below. 
According to the definition (69.2), the explicit form of the effective field is 
found by varying the total free energy of the body. This energy is 


f= f {JA M) + Unon-u— M » H— H?/87} dV; (69.5) 


see ECM, §36. Here fo(M) is the free energy density of a uniformly magnetized 
body at H = 0, which takes account only of the exchange interactions and is 


t See ECM, §36, where the thermodynamic quantities relating to the body as a whole are 
denoted by script letters. For an inhomogeneous distribution, it is more correct to speak of the 
free energy of the body (for a given volume), not of the thermodynamic potential Ø. We shall 
not be concerned here with striction effects, i.e. stresses and strains that occur in the crystal 
when the magnetization varies. 
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therefore independent of the direction of M; U,,,.y is the additional exchange 
energy density due to the slow change of direction of M through the non-uni- 
formly magnetized body. 

The first terms in the expansion of this energy in powers of the derivatives of 
the moment M with respect to the coordinates are 


1 OM ôM 
Unon-u = 2 Ox, Ox. , (69.6) 


this form (quadratic in the derivatives) is positive-definite. The expression (69.6) 
is constructed so that itis independent of the absolute direction of M (in accord- 
ance with the properties of the exchange interaction). In uniaxial crystals, the 
symmetrical tensor a, of rank two has components «,, = @,, = a1, @,, = a 
(where the z-axis is the axis of symmetry of the crystal); in cubic crystals, 
Oy, = aby. 

The order of magnitude of the coefficients «;, may be estimated by noting 
that the non-uniformity energy relative to the volume of one unit cell in the 
crystal lattice would have to equal the characteristic atomic values of the ex- 
change interaction energy if the direction of the moment varied considerably 
over distances of the order of the lattice constant a. The characteristic exchange 
energy is equal, in order of magnitude, to the Curie temperature T, (the point 
at which the ferromagnetism disappears). From the condition T,/a° ~ aM?/a? 


we find 
a ~ T,/aM?. (69.7) 


Varying the integral (69.5) for given values of H at every point of the body, 
and integrating by parts in the second term, we obtain 


, o’M 


According to the definition (69.2), the expression in the braces is —Hyg. The 
first term is parallel to M, but disappears on substitution in the equation of 
motion (69.4), and may therefore be omitted.’ Thus we find 


o’M 


ETETA +H. (69.8) 


Herr = Qik 


When only the exchange interactions, independent of the direction of the 
magnetic moment, are taken into account, for a uniformly magnetized body 
equation (69.4) must reduce to the equation of motion of a freely precessing 
moment: 





t But then H, need not be zero at equilibrium. 
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where e = —!e| and mare the electron charge and mass, and g the gyromagnet- 
ic ratio of the ferromagnet (cf. Fields,§45). On the other hand, for uniform mag- 
netization H.~ = H; hence it follows that the constant coefficient in (69.4) is 
g| e|/2mc, and the equation of motion is therefore 


Her M, (69.9) 





with H,, from (69.8). 

To form a complete set of equations, we must add Maxwell’s equation relat- 
ing the field H to the distribution of the magnetization M. The spin waves, to 
be discussed in §70, are low-frequency waves in the sense that w « ck. Under 
these conditions, the field is quasi-steady; the time derivatives may be neglected 
in Maxwell’s equations, which become 


curl H =0, divB = div(H+4xM) = 0. (69.10) 


The question may arise here of the legitimacy of varying the integral (69.5) 
with respect to M with H constant despite the fact that they are related by the 
second equation (69.10). The answer is that, if we put H = —v¢@ (from the 
first equation) and calculate the variation of the integral with respect to 4, it is 
zero by the second equation, so that the variation with respect to H makes no 
contribution to 6F. 

If the body is not in an external magnetic field, the field within it is due entirely 
to the distribution of the magnetization, and is in general of the same order as 
M. In this sense, the term H in the effective field (69.8) is a relativistic effect (the 
atomic magnetic moments, and therefore the spontaneous magnetization M, 
are determined by the Bohr magneton B=|e|4/2mc, which has c in the denom- 
inator). In the purely exchange approximation so far considered, therefore, 
the second term in (69.8) is to be omitted, and the equation of motion is 


ôM _ gle OM 


ðr Qme Ox, Oxy ae Se 


This equation is not linear. 
Equation (69.11) may be rewritten as the equation of continuity for the mag- 
netic moment: 











OM; | Olly _ 
ae) Ome > 
where the moment flux tensor is 
_ gjel ðM 
me T (mx Ox ) 


This was to be expected, since in the exchange approximation the total magnetic 
moment of the body is conserved. 


§ 70 Magnons in a Ferromagnet. The Spectrum 289 


Let us now take into account the fact that in a ferromagnet there are not only 
exchange interactions but the considerably weaker relativistic interactions 
(spin-spin and spin-orbit) of the electron angular momenta. In the macroscop- 
ic theory, they are described by the magnetic anisotropy energy, whose den- 
sity Un depends on the direction of the magnetization vector relative to the 
crystal lattice; these interactions determine the equilibrium direction of the 
spontaneous magnetization in the ferromagnet. The relativistic interactions in- 
clude, as already mentioned, that of M with the magnetic field H. 

In a uniaxial crystal, the anisotropy energy has the form 


Un = -4K M2. (69.12) 


If K > 0, the equilibrium magnetization is along the axis of symmetry (the 
z-axis; a ferromagnet of the “easy axis” type); if K < 0, the direction of the 
spontaneous magnetization lies in the xy-plane (a ferromagnet of the “easy 
plane” type). In a cubic crystal, the anisotropy energy may be written 
Uan = (K’/M?)(M2M3+ M3M2+ M3M2), (69.13) 
where the axes x,y,z are along the three fourth-order axes of symmetry (the 
edges of the cubic cells). If K’ > 0, the equilibrium vector M is along one edge 
of the cubic cells; if K’ < 0, it is along one of the spatial diagonals of the cells.* 
We shall consider the particular case of a uniaxial ferromagnet. Adding to the 
integrand in (69.5) the term U,» (69.12), we obtain on variation an additional 
term —KM,v.6M, where v is a unit vector along the axis of symmetry of the 
crystal. Thus we find for the effective field 
Her = Qik ton SMYTH. (69.14) 
It is easily seen that beyond this change in the effective field there is no altera- 
tion of the equation of motion (69.9) due to the inclusion of relativistic effects: 
the neglect of dissipation signifies as before that the right-hand side of the 
equation of motion must be perpendicular to H,,, i.e. must have the form 
M’ XH.,, where M’ can differ from M only by the inclusion of relativistic 
effects, which are always small in comparison with the large quantity M and 
are therefore unimportant. The relativistic terms in H,, are added to a quantity 
that is small because M varies only slowly through the body; these terms may 
become significant at sufficiently long wavelengths. 


§ 70. Magnons in a ferromagnet. The spectrum 


Let us apply the equations derived in §69 to the propagation of waves in 
which the magnetic moment density executes small oscillations, precessing 
about its equilibrium value Mo. We shall consider a one-domain sample with Mo 

t The dimensionless quantities K and K’ for various ferromagnetic substances have values 


in a wide range from severa! tenths to several tens. The order of magnitude of the ratio of the 
relativistic and exchange interactions is given by a°U,,/T7,, which is usually 1074-1075. 
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constant throughout its volume, and take only the case of wavelengths much 
less than the size of the sample. The medium may then be regarded as infinite. 

Let us first take account of the exchange interactions only, i.e. use equation 
(69.11). We put M = Mo-+m, where m is small, and linearize the equation by 
omitting the terms of the second order in m; since M = Mo, in this approxi- 
mation m L Mo. The result is 


————XMo; (70.1) 


here and henceforward we take g = 2. For a coordinate and time dependence 
of m in the form expli(k.r—wt)], we find 


iwm = (|e|/mc) «xk?m XMo, (70.2) 


where « = a(n) = «xħin, and n is the unit vector in the direction of the wave 
vector k. Expanding this equation in components gives 


iwm, = (je|/mc)aMk*m,, 
iwm, = —(|e|/mc) «Mk*m,, 


the z-axis being in the direction of Mo. Hence the dispersion relation for spin 


waves ist 
w = (|e| M/mc) a(n) kè. (70.3) 


We see that, in accordance with the discussion at the beginning of §69, the fre- 
quency in the exchange approximation tends to zero as k — 0. The vector m 
in the spin wave rotates in the xy-plane with constant angular velocity w, 
remaining constant in magnitude. 
In the quantum picture, the formula (70.3) determines the energy spectrum 
of magnons (e = fiw):* 
&(k) = 2BMea(n) k?. (70.4) 


In the second-quantization formalism, the macroscopic quantities describing 
the ferromagnet are replaced by operators expressed in terms of the magnon 
annihilation and creation operators. We shall show how this is to be done for 
the magnons (70.4). 

We assign to the classical quantity M a vector operator M whose compo- 
nents satisfy certain commutation rules. Let S(r) ôV be the operator of the 
total spin of the atoms in a physically infinitesimal volume element ôV at the 
point r. The operators (r1) 6V and S(r.) 6V2 relating to different elements 


t A quadratic dispersion relation for spin waves was first obtained by F. Bloch (1930), 
using a microscopic theory. The expression of this spectrum in terms of macroscopic param- 
eters was given by L. D. Landau and E. M. Lifshitz (1945). 

t In this chapter, $ everywhere denotes the Bohr magneton, f = | e| #/2mc. 
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ôV, and dV, commute, but the components of one and the same operator 
S(x)6V obey the usual commutation rules for angular momentum: 


S, ôV. S, 8V — S, 8V. Sx 6V = iS, ôV, 


or $,5,—S,8,, = i$,/ôV (and similarly for the other commutators). In the 
limit as ôV — 0 these rules can be written for any r, and re 


$.(r1) Sy(t2)— S (r2) 8.41) = iS:(r1) ô(r1—r2). 


Now multiplying this equation by 46? and noting that the magnetization opera- 
tor M = —268, we obtain 


M01) M,(t2)— M,(t2) M(01) = —2i8M(n1) 8(81— r2). (70.5) 


For spin waves in which M executes small oscillations round the z-axis, in 
the first approximation with respect to the small quantities m, and m,, we can 
replace the operator M, by the number M, ~ M; then 


Hixhri) m,(F2)— m,(re) (01) = —2ipMô(rı— r2). (70.6) 


From this it is clear that m, and m, are in this case (apart from constant factors) 
the canonically conjugate “generalized coordinates and momenta”, like @ 
and 0’ in the quantization of sound waves in a liquid (§24). There is, however, 
an essential difference between the two cases. The commutation rule (24.7) for 
the phonon operators is exact, and does not depend on the smallness of the 
oscillations (i.e. the smallness of the phonon state occupation numbers). The 
rule (70.6) is approximate and is valid only in the first approximation with 
respect to the small quantity m. 

From the commutation rule (70.6) and the relation between the operators 
M, and th, that corresponds to the linear equations (70.1), we can find expres- 
sions for these operators in terms of the magnon annihilation and creation 
operators, as was done for phonons in §24 (see §71, Problem 4). 

Let us return to the study of the magnon spectrum and the influence on it of 
relativistic effects. It is now necessary to take account also of the magnetic 
field H that results from the oscillations of M. This will be of the same order of 
smallness as m; it will be denoted here by h. 

Maxwell’s equations (69.10) give 


kxh=0, k.h =—47k.m. 
From this we see that h is parallel to the wave vector, and is 
h =—47(n.m)n. (70.7) 


Substitution of (70.7) in the last two terms of the integrand in (69.5) leads to 
—m .h—/?/82 = 2n(n.m)?*; (70.8) 
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the term Mo.h is here omitted, since on integration over the whole volume it 
becomes a surface integral (owing to the fact that h is a potential field) and 
vanishes. The part (70.8) of the anisotropy energy in a spin wave is sometimes 
called the magnetostatic part. 

Let the ferromagnet be uniaxial and of the “easy axis” type, so that Mo is 
parallel to the axis of symmetry of the crystal (the z-axis): Mo = vM. With 
a view to later applications, we shall allow the presence also of an external 
field H parallel to the same direction v; the sample is to be regarded as a cylinder 
with its axis along v. Then the field within the body is H = H +h. The linear- 
ized equation of motion, which is here written after multiplication by 4, is 

—iem = 2B{(ak?+K+9/M) vXm—v xh}. (70.9) 
For a uniaxial crystal « = «1 sin? 0+«2 cos? 4, where 0 is the angle between 
k and v. 

Substituting h from (70.7), we write out the equation in components; it is 
convenient to take the x-axis in the plane through the directions of v and n. 
From the condition for the resulting two equations for m, and m, to be com- 
patible, we find the dispersion relation 


e(k) = 2BM[(ak?+K+9/M)(ak?+K+/M+ 4a sin? 6)}/2, (10.10) 


Because of the term in sin? 0 = k2/k2, the expansion of e(k) in powers of the 
components of k is not a simple power series. This is due to the long range of 
the magnetic interactions. 

An expression of the form (70.10), which has been derived here for a uniaxial 
ferromagnet (of the “easy axis” type), is valid for cubic crystals. This follows 
because the change in the anisotropy energy for small deviations of the vector 
M from its equilibrium direction has the same form in either case. For example, 
in a cubic crystal with K’ > 0 the change ôU,» when M deviates from the 
direction of Mo along an edge of the cube depends only on the angle # between 
M and Mo, and is 6U,,, = K’M?0?. Comparing this with the corresponding 
expression 6U,,, = +KM?2®? for a uniaxial crystal, we see that, to change to the 
case of a cubic crystal with K’ > 0, we need only replace K by 2K’ in (70.10). 
Similarly, it is easy to see that, to change to the case of a cubic crystal with 
K’ < 0(Mo along a spatial diagonal of the cube), we must replace K by 4| K’|/3. 
Moreover, in a cubic crystal a(n) reduces to a constant. For a uniaxial ferro- 
magnet of the“easy plane” type (K<0), the situation is different: the change 6U,, 
when M deviates from Mo depends both on the polar angle and on the azimuth 
of the direction of M relative to Mo; this case therefore needs special consid- 
eration (see the Problem). 

The result (70.10) relates only to the initial part of the spectrum, in which 
the quasi-momenta k < 1/a and a macroscopic treatment is permissible. At 
large k that still satisfy this condition (ak? >> 42, K) the expression (70.10) 


becomes 
e(k) = 28Ma(n) k?+ 289). (70.11) 
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The first term here is the same as the purely exchange expression (70.4). The 
external field simply adds the term 28 to the magnon energy. In this approxi- 
mation, therefore, the magnon has a moment projection on Mo equal to —2. 
The excitation of each magnon reduces the total magnetic moment of the body 
by 28. 

In the opposite case, as k — 0, the expression (70.10) tends to a non-zero 
quantity, which for H = 0 is 


Le ae 
e(0) = 28MK (tE sin? a) l (70.12) 


Thus the inclusion of the magnetic anisotropy causes an energy gap in the mag- 
non spectrum. (The corresponding frequency w(0) = e(0)/A is called the ferro- 
magnetic resonance frequency.) This is natural, since in the presence of aniso- 
tropy even a rotation of the magnetic moment as a whole (i.e. for k = 0) 
involves a finite energy. We see that for small k the relativistic effects, though 
small, cause relatively large corrections to the spectrum. 

The concept of magnons as elementary excitations applies to weakly excited 
states of the body, and therefore to low temperatures. Hence, in the formulae 
relating to magnons, the values of all the constants of the material (including 
the magnetization M) must be taken for T = 0. 

Let us now return to the assumption made in §69 that the dissipation is weak. 
In the quantum picture, the dissipation means that the lifetime of the magnons 
is finite because they interact with one another and with other quasi-particles. 

If we take first of all the interaction between magnons, we must begin by 
noting that in the exchange approximation the number of magnons is constant: 
each magnon makes the same contribution —28 to M, and the exchange inter- 
action conserves M,. In this approximation, therefore, only scattering processes 
can occur. Their probability, however, decreases with the temperature, simply 
because of the decrease in the number of scatterers, so that the exchange decay 
always tends to zero at T = 0. We shall see later (§72) that a state with one mag- 
non in the exchange approximation is in fact a strictly stationary state of the 
system.' 

At T = 0 the decay of magnons is due only to their disintegration. Such 
processes can occur only because of relativistic interactions, and their proba- 
bility is therefore small. Moreover, for small k the disintegration probability 
is always decreased by the smallness of the statistical weights (phase volumes) 
of the final states of the process. 

The decay of magnons is also caused by their interaction with phonons; 
here the part of the exchange interaction Hamiltonian that depends on the 


t It may also be noted that the cross-section for mutual scattering of two magnons in the 
exchange approximation tends to zero as their energy decreases (see §73). This further reduces 
the exchange decay of magnons at low temperatures. At sufficiently low temperatures, relativ- 
istic effects are important also for scattering processes. 
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deformation of the crystal acts as the perturbation operator. At T = 0 the 
creation of a phonon by a magnon is possible, but the magnon quasi-momen- 
tum must be sufficiently large: the magnon velocity ðe/ñ Ok must exceed the 
velocity of sound (cf. the third footnote to §65). The probability of the process 
is small, because the statistical weight of the final state is small. 

Finally, in a ferromagnetic metal there can always occur excitation by a 
magnon of an electron from below the Fermi surface, because of the exchange 
interaction with conduction electrons. Here again, the probability of the pro- 
cess at small k is small, because the statistical weight of the final states is small. 


PROBLEM 


Find the magnon spectrum in a uniaxial ferromagnet of the “easy plane” type (K < 0). 


SOLUTION. The equilibrium magnetization M, lies in the plane perpendicular to the axis 
of symmetry of the crystal (the z-axis); let its direction be taken as the x-axis. The linearized 
equation of motion of the magnetic moment in this case has the form 


— iem = 2B{ak? n, Xm- | K | mn,—n,Xh}, 


where n, and n, are unit vectors along the coordinate axes, and the vector m lies in the yz-plane, 
which is perpendicular to Mp. Substituting h from (70.7), writing out the equation in compo- 
nents, and equating to zero the determinant of the resulting system, we obtain the magnon 
spectrum 


elk) = 2BM[ak*(ak?-+| K |)+ 47 sin? 6 (ak?+| K | sin? $)”, 
where 0 and @ are the polar angle and the azimuth of the direction of k relative to that of Mo, 


the azimuth being measured from the xz-plane. When «k? >>1, we return to the same quadrat- 
ic spectrum (70.4), and when k — 0 the magnon energy tends to 


€(0) = 4(2| K|)"? BM |sin 6 sin |, 


which is zero when the vector k is in the xz-plane which contains the axis of symmetry and the 
spontaneous magnetization of the crystal. This vanishing is, however, only approximate: the 
inclusion in the anisotropy energy of terms of higher order causes anisotropy in the xy-plane 
and therefore a finite energy gap for all directions of k.t 


§ 71. Magnons in a ferromagnet. Thermodynamic quantities 


Magnons excited in a ferromagnet make a certain contribution to the ther- 
modynamic quantities in it. The results obtained in §70 enable us to calculate 
this contribution at temperatures that are low in the sense that T < T,. For, 
in thermal equilibrium at a temperature T, most of the magnons have energy 
€ ~ T. For a quadratic spectrum 


e(k) = 28Mea(n) k2, (71.1) 


t It may be recalled (see ECM, §37), that the expansion of the anisotropy energy in powers 
of M is really an expansion in terms of the relativistic ratio v/c (and is not due to the smallness 
of M, i.e. to the closeness of the Curie point). 
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this means that at temperatures T << T, magnons are excited with quasi-mo- 
menta k «(T,/BMa)'”. Using the estimate (69.7) for « and estimating the mag- 
netization as M ~ 8/a? (the magnetic moment per unit cell is of the order of 
a few times £), we find ak « 1, i.e. the condition for the results of §70 to be 
valid. 

The “magnon” parts of the thermodynamic quantities in a ferromagnet are 
calculated as the thermodynamic quantities in an ideal Bose gas with zero 
chemical potential. For example, the magnon part of the thermodynamic po- 
tential Q is 


Qmag = T f log (1—e-"!7) V dk /(27)8; (71.2) 
see Part 1, (54.4). The magnon contribution to the internal energy is then‘ 
Emag = Qmag—T OQmag/OT 


€ V dk 
-Í eTii (a ` Cha 


The magnon contribution to the spontaneous magnetization determines its 
temperature dependence. It is calculated as the derivative 


Mass = M(T)-M0) = -5 | = | RE 





V| a 


with respect to the external magnetic field; cf. ECM, (31.4). Differentiating the 
expression (71.2), we obtain 


ôe 1 dk 
Mau =- Í al PT Ga (11.4) 


The derivative —ðes/ðH is the intrinsic magnetic moment of the magnon. 

Let us calculate the integrals (71.3) and (71.4) for temperatures T >> 2%8M;;* 
then the limiting expression (71.1) may be used for the magnon spectrum. In 
view of the rapid convergence of the integrals, the integration may be extended 
over all k-space instead of over one reciprocal lattice cell. Taking « as constant 
(for cubic crystals) and replacing d°k by 4k? dk, we get after an obvious 
substitution 


VT f ldx 
4n? Ase | e—1 
ò 


Emg = 


= VTT (5/2) 0(5/2)/4n2A3?2, 


t With the chemical potential u = 0O (and therefore also Ø = Nu = 0) we have 
E = Ø+TS—PV = TS+2Q; the entropy S = —3N/ðT. The expression (71.3) could also, of 
course, be written down directly without using (71.2). 

+ For the typical value M = 2 103 G, this condition gives T >> 1°K. 
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where for brevity A denotes 26M« (so that e = Ak?).t For the specific heat 
C mag = OE ma,/OT, we hence find 
51°(5/2) (5/2) 
C mag = V -— gag T 3/2 


= 0.113(T/A)9? V. (11.5) 


This expression gives only the magnon part of the specific heat; the specific 
heat of the crystal also contains the usual phonon part. 

Returning to the integral (71.4), we substitute the value — 2 for the magnetic 
moment of the magnon according to (70.11). The result is, for T = 278M, 


__ pr? P xil? dx 
272 A312 ex—1 ’ 
ò 


Mmag = (71.6) 


whence 
M(T) = M(0)—T*? T (3/2) £(3/2)/2n2A3!? 


the magnon contribution is, of course, the whole of the change in the magneti- 
zation, since the phonons carry no magnetic moment. Thus the change in the 
spontaneous magnetization in the temperature range 278M < T « T, follows 
a T°” law (F. Bloch 1930). 

The presence of the gap (70.10) in the magnon spectrum leads to an expo- 
nential dependence of C mag and M,,,, on T at still lower temperatures: when 
T < BKM, 

Cmag, Mmag £ exp (—28KM/t). (71.8) 


The quantity in the numerator of the exponent is the least value of the energy 
gap, reached at 0 = 0 and 0 = x; see also Problem 1. 

If the spontaneous magnetization of a ferromagnet in the ground state is 
equal to the greatest possible (nominal) value, corresponding to the case where 
all the atomic moments in the body are parallel, this value is unchanged by the 
application of an external magnetic field in that direction, i.e. the susceptibility 
x in that direction is zero. 

The inclusion of relativistic interactions reduces the spontaneous magnet- 
ization (at T = 0) in comparison with its “exchange” value, and causes the 
appearance of a non-zero susceptibility (T. Holstein and H. Primakoff 1940). 
Although this effect is very small, its calculation is of fundamental interest. 

In the foregoing calculation of the magnetic part of the thermodynamic quan- 
tities, we have omitted the zero-point energy of the “magnetic oscillators”, 
which makes no contribution to the temperature dependence of these quantities. 
The zero-point energy corresponds to magnon state occupation numbers of 1: 


[LEO)lmag = f 3 Elk) V dk/2n). 


t See Part 1, §58, regarding the calculation of such integrals. 
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Accordingly, the “zero-point” magnetization is 
M(0) = — f 4 (@e/0%) d?k/(2x)°. (71.9) 


This integral diverges for large k, i.e. is determined mainly by the short-wave- 
length magnons (ka ~ 1), which cannot be treated macroscopically. However, 
the change in the magnetization under the influence of relativistic effects is 
determined, as we shall see, by the long-wave region of the magnon spectrum, 
and can be calculated by means of the formulae derived in §70. 

For simplicity, we shall consider a cubic crystal, and neglect the anisotropy 
constant, which is small in that case, i.e. write the magnon spectrum (70.10) 


in the form 
e(k) = 26[(bk?+ )(bk?+ + 40M sin? 6)}4/2, (71.10) 


where b = aM; in this expression, the term 47M sin? 0 coming from the mag- 
netostatic energy corresponds to the relativistic effects. The required change 6M 
in the magnetization under the influence of relativistic effects is found by sub- 
tracting from (71.9) a similar integral with ¢,,.4.(k) = 28bk?+28 in place of 
e(k): 


M=- > f ma [e(k) — eexs(k)] Er (71.11) 


This integral is convergent for large k.* 

In the calculation, it is convenient first to differentiate with respect to M at 
constant b, and the notation b has been used in (71.10) for this purpose. A simple 
rearrangement gives 

ðM _— 4n* BM sint 6 . 27k? dk.sin 0 d0 
ôM (Or | | (bk? + H} (bk?+ §+42M sin? 6)5!2 ` 
0 0 


Since the integration with respect to k converges, it can be extended to æ. 
The integral with = 0 is easily evaluated; then, integrating with respect 


to M, we obtain 
8M =—+/x B/803!2. (71.12) 


This is a very small quantity: 6M/M ~ 1078. 
If the external field is large (H >> 4M), we can neglect the term 42M sin? 0 
in the denominator of the integrand. The result is then 


6M = — 22 BM 2/1503/2 g. (71.13) 

When § — œ, ôM tends to zero, as it should. 
t To avoid misunderstanding, we may note that the correction to the ground-state energy 
cannot be determined in this way: without the differentiation with respect to §, the integral 


of e—&..., is divergent when the long-wavelength expressions for the magnon spectrum are 
used. 
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To conclude, we may note that, if we had attempted to investigate the tem- 
perature dependence of the magnetization in a two-dimensional ferromagnet by 
the same method as has been used in this section for the three-dimensional case, 
then (in the purely exchange approximation) we should have obtained instead 
of (71.6) a logarithmically divergent integral. This means that the spontaneous 
magnetization in a two-dimensional system with exchange interaction is ac- 
tually zero for all T = 0. This is analogous to the result found in §27 for a two- 
dimensional Bose liquid, and in Part 1, §137, for a two-dimensional crystal. 
Since the energy of the system is independent of the direction of the magnetic 
moment, the expression for it contains only the derivatives of the vector M; 
this in turn leads ultimately to the divergence of the fluctuations (in the two- 
dimensional case) that destroy the magnetization. The inclusion of the relativ- 
istic interactions, which depend on the direction of M, stabilizes the fluctua- 
tions and makes possible the existence of a two-dimensional ferromagnet. 


PROBLEMS 


PROBLEM 1. Calculate the magnon parts of the thermodynamic quantities at temperatures 
T < e(0). 


SOLUTION. The important magnons are those with small quasi-momenta k propagated in 
a direction where the gap is least, i.e. near 0 = O or 0 = 2; these two directions give equal 
contributions. For example, when 8 is small we have, with the necessary accuracy, 


e(k) = 28KM + Ak? +-4BM0?, 


where A = 2ßMa for cubic crystals and A = 26Ma, for uniaxial crystals of the “easy axis” 
type. The magnon distribution at these temperatures may be taken as a Boltzmann distribu- 
tion (i.e. we may neglect unity in the denominators of the integrands) and e(k) may be replaced 
by (0) everywhere in the coefficients of the exponentials. Integrations with respect to k 
and 6 are extended to infinity; the result is 


KT®? ( 2BKM 


Fuse = Vagn P \- 


E, 
e Man =F 


In calculating the specific heat, only the exponential factor need be differentiated : 
Cmag = (2BKM/T*) Emag- 


PROBLEM 2. Determine the dependence of the magnetization on the external! field with the 
conditions § >> 42M, T >> BS. 


SOLUTION. Under these conditions, the relativistic terms may be neglected, and e(k) written 
in the form (70.11). Differentiating the expression (71.4), we find 


ðM = 42 etl? d®k 


0H (e/7 1)? (27)8 s 


The important values of k in the integral are small. Hence 


ak k? dk 
on rf E Qa) =f (ek? Mo +H)? ; 
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we take a = constant, and M, is the value of M for Q = 0. The final result is 


0M = T 
Fs) 8n(aM, Po * 


Thus, under these conditions, M— My œ $". 


PROBLEM 3. Determine the dependence of the magnetization at 7 = 0 on a weak external 
field. 


SoLuTION. Differentiating the integral (71.11), with e(k) from (71.10), with respect to Q, 
we obtain 


ðM _ 4n®M 26 sin‘ 0 dk 
“8D J [EMK +4nM, sin? 6+ §) OM k+ (272)? ` 


As 9 — 0, the integral over dk diverges logarithmically for small k. Hence, with logarithmic 
accuracy, we can put k = 0, § = 0 in the first factor in the denominator, and § = 0 in the 
second factor, but cut off the integral below at k? ~ §/aM, and above at k? ~ 4n/a. The 
result is 





OM _ — Bb lo 4nM, 
® 32 yaM a Ss: 


In (71.10), K is neglected. When § « KM, § is replaced by KM, in the logarithm. 


PROBLEM 4. In the exchange approximation, determine the spatial correlation function of 
the fluctuations in the magnetization at distances r >> a. 


SOLUTION. The operators m, and M, satisfying the commutation rule (70.6) and expressed 
in terms of the magnon annihilation and creation operators have the form (in the Schrödinger 


representation) 
ialt) = (BM/V)¥? 2 (G, e: -At e7i t), 
k 


Mult) = i(BM/V)!? Y (Gy eft -1 de e- +1), 
k 


With these operators, we calculate the correlation function 
Plt) = F (Pult) lle) + Al) Palri), r= ri— T3, 


the suffixes i and k taking the values x and y. Since only the products (tâ = m and 
(kát Y = n+ 1 (where n are the magnon state occupation numbers) have non-zero diagonal 
matrix elements, we find 


Palt) = ôa | 28M (m4) e:t d?k/(27)%. 


The integrand gives directly the Fourier component of the correlation function. The constant 
term may be omitted: it corresponds to a delta-function term in $,(r), whereas the whole 
analysis relates only to distances r > a. Thus 


Palk) = 2BMmo, = 2BM[ee™/7— 1]! dy. 
In the classical limit, when £ « 7, we find 
alk) = 5y7/ak*. 
In a cubic ferromagnet, « = constant, and 


Out) = SuT/4xar, rs» (BMa/T)'/. 
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§ 72. The spin Hamiltonian 


To derive the magnon dispersion relation for all values of the quasi-momen- 
tum, and not only in the long-wave limit, we must naturally make use of more 
detailed information about the microscopic structure of a ferromagnet. 

Let us consider an insulator consisting of atoms with zero orbital angular 
momentum but non-zero spin S. If we are not concerned with highly excited 
states resulting from excitation of the electron shells of the atoms, we can 
average the Hamiltonian of the system over the orbital variables of the electrons 
of atoms in the ground state (with the atomic ruclei fixed at the lattice sites). 
This gives the spin Hamiltonian of the system, containing only the operators 
of the total spins of the atoms.‘ 

If we take into account just the exchange interaction, which depends only 
on the relative orientations of the spins, the operators of the atomic spin 
vectors must appear in the Hamiltonian as scalar combinations. It is of con- 
siderable methodological interest to investigate a system described by the 
simple Hamiltonian 


exch = —+ > JanS uas Jam = J(ta— tm), (72.1) 


m+n 


where the summation is over all atoms; the “vector” suffixes m and n (with 
integral components) label the lattice sites, and r, are their position vectors. 
The numbers J,,, are called exchange integrals; cf. QM, §62, Problems.* With 
independent summation over m and n, each pair of atoms appears twice in 
the sum (72.1), and of course J m = Jmn 

In (72.1), all the magnetic atoms in the lattice are assumed to be the same 
(one in each unit cell). The basic assumption underlying this Hamiltonian is, 
however, that the atoms in the lattice are sufficiently far apart. The exchange 
integral is determined by the “overlap” of the wave functions of two atoms, and 
decreases very rapidly (exponentially) as the distance between them increases. 
For a system of atoms far apart, we can therefore assume a pair interaction, 
and (72.1) then contains no terms with products of spin operators of more than 
two atoms. To the same accuracy, we may suppose that the exchange inter- 
action between two atoms comes about through a single pair of electrons, one 
in each atom. Then the interaction operator is formed bilinearly from the 
electron spin operators, and after averaging over the states of the atoms, biline- 
arly with respect to the atomic spins (C. Herring 1966).° 

t The procedure is similar to the construction of the Hamiltonians of individual atoms 
describing the fine structure of the atomic levels; cf. QM, §72. 

+ The description of the exchange interaction by means of the spin Hamiltonian is due to 
P. A. M. Dirac (1929). The Hamiltonian (72.1) was introduced by J. H. van Vleck (1931), and 
is usually called the Heisenberg Hamiltonian, because it corresponds to a ferromagnet model 
first discussed by Heisenberg. 

$ In such conditions, the summation in (72.1) must of course be taken only over pairs of 


adjacent atoms. This, however, does not simplify the formulae, and it will therefore not be 
shown explicitly. 
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A system described by the Hamiltonian (72.1) is ferromagnetic if the ex- 
change integrals J,,, > 0. Let us determine the ground-state energy of such a 
system, and suppose that an external magnetic field § is also present, by adding 
to (72.1) the operator 


the z-axis being parallel to the field. The operator ZS: of the projection of the 
total spin of the system commutes both with H,,., and with P; the states of 
the system may therefore be classified by the eigenvalues of this quantity. 

In the ferromagnetic case, the ground state corresponds to the greatest 
possible value NS of the total-spin projection, where N is the number of atoms 
in the system; this result is, of course, independent of the presence of an ex- 
ternal field, which simply distinguishes the direction of the axis. 

Let yo be the normalized spin wave function of the ground state. The maxi- 
mum value NS of the total-spin projection can be reached only if the spin 
projection of every atom also has its maximum value S. Hence yo is an eigen- 
function of each of the operators S,,: 


SazXo = Sxo. (72.3) 


In the following, we shall need the operators S. = $,+i8,, which obey the 
commutation rules 


§,8_—S8_S8, =28,, SS- SS: =s; (72.4) 


see OM, (26.12). Their matrix elements are 
(S21 S4] S.- 1) = (S,—1|S_] Sa = V/[(S+S,)(S—S:+1)]; (72.5) 


see QM, (27.12). The operator $, increases by one the value of the component 
S,, and S_ decreases it by one. We also write 


Sado = SmazSar tH Smt Sa- + Sm—Sa+) 


and 


Ê =—}4 Ý Jrn(SmsSaz+ $m—Sa+)— 289 >, Sms, (72.6) 


ma 
using the symmetry Jmn = Jam and the commutativity of operators pertaining 
to different atoms. 
Since the operators §,, have matrix elements only for transitions with in- 
crease of the numbers S,, for the state with the maximum values of these 


numbers we have 
Sa+%o = 0, (72.7) 


as is seen also from the explicit expressions for the matrix elements (72.5). 
Hence, when the Hamiltonian (72.6) acts on the wave function yo, the result is. 


Ayo = {-3 „A, Jaa? 2BGNS} Zo. 
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The expression in the braces is the energy Eo of the ground state. Replacing 
the summation over m and n by summation over m and q = n—m, we can 


finally write 
ax 0 


The total magnetic moment of the system in this state is 28SN. 

The next state of the system, in order of decreasing total-spin projection, 
corresponds to the value NS—1 of the latter, and to the excitation of one mag- 
non with magnetic moment — 28. This value of the total-spin projection occurs 
for a state with wave function 


(2S) 126 n—X0> (72.9) 


in which the effect of the operator §,_ reduces by one the spin projection of 
one atom.! This function is, however, not an eigenfunction of the Hamiltonian 
of the system; it does not yet take account of the translational symmetry of the 
lattice. The eigenfunction of the Hamiltonian must be constructed as a linear 
combination of the functions (72.9) with all n. Similar arguments to those which 
in §55 gave the Bloch functions for an electron in a periodic field show that, 
to take correct account of the translational symmetry, this linear combination 
must have the form 

Zk = (2NS)-¥? F efk- Sy_xo; (72.10) 

a 


the factor N- is for normalization. The constant vector k is just the quasi- 
momentum of the magnon. 

The energy e(k) of the magnon is the difference E,,— Eo between the energies 
of the excited and ground states of the system. Hence 


(AE) %k = e(K) Yu. 


Substituting on the left-hand side the expression (72.10) and replacing Eoyo 
by Ayo, we obtain 


&(k) yk = (2NS)-¥2¥ efk- *0(S,_ — Sa) xo. (72.11) 


The commutator here is easily calculated by writing in the form (72.6) and 
using the commutation rules (72.4). Again taking into account the symmetry 
of the coefficients J,,,, we find 


AS,—— Sa = Y’ Ian(Smz5a—— $o25m—)+2 b9sa. (72.12) 


t The normalization of the function (72.9) is easily verified by noting that 
(Sa-%o)* (Sa~%o) = x3S83Sa-Xo 
= (S | Sa+Sn- | S) 
= (S| Sat] S- 1X(S—1 | Sa- | SY 
= 28. 


§ 72 The Spin Hamiltonian 303 


Finally, substituting this expression in (72.11), using (72.3), and changing to 
summation over q = n—m, we have 


e(k) yk = [S 2, mey Xk- 


The expression in the braces is the required magnon energy. The imaginary 
part of the expression under the summation sign, being an odd function of 
Fy gives zero on summation, and the result is 


ek) = S X Jal—cos k.n) +266 (12.13) 


(F. Bloch 1930). 

This formula gives the exact dispersion relation for magnons in a system 
described by the Hamiltonian (72.1). In the limit of small k, it becomes, of 
course, the quadratic relation 


e(k) = Skikk Y Jerar t 28H. (12.14) 
az 


The Curie point of the system in question is at a temperature T, ~ J, and 
so at temperatures T >> J the system is certainly paramagnetic. At such tem- 
peratures we can, as a first approximation, altogether neglect the interaction 
between the atoms. In this approximation, the magnetic susceptibility of the 
system will be the same as that of an ideal gas of atoms with spin S, namely 
_N 48S(S+1) 


V 3T 


(see Part 1, §52); the susceptibility is per unit volume. This expression is the 
first term in the expansion of the function y(T) in powers of 1/7. The subsequent 
terms do depend on the interaction of the atoms; we shall determine the first 
such term. 

The susceptibility (in a zero field) is defined as the derivative y = 0M/O 
as © — 0, and the magnetization M is calculated as the derivative of the free 
energy: VM = —OF/0Q%. To solve the problem, we need to calculate F as 
far as the terms in 1/T?. A 

We start from the formula F = —T log Z, where Z is the partition function 


Z = }ie-FwT 


(72.15) 


TE 

tee T° T 6T?’ 
the summation is over all the energy levels of the system.’ The total number of 
levels in the spectrum of the system is finite, and equal to the number of possible 


t The following calculation of the free energy corresponds to that in Part 1,§73, continued 
to the next term in the expansion. 
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combinations of orientations of the atomic spins relative to the lattice. Each 
spin has 2S + 1 different orientations; this number is therefore (2S + 1)”. Denoting 
by a bar the arithmetic mean, we can write 
= (28+ 1)" [1p B+ ope E- : ar). 
T 2T? 6T? 

The mean value Æ" = tr A”/(2S+1)". According to a well-known prop- 
erty, the trace of the operator can be calculated with any complete set of 
wave functions; we take as these the functions corresponding to all possible 
combinations of orientations of the atomic spins. Then the averaging reduces 
to an independent averaging of each spin over its directions, and E = 0. Now 
taking the logarithm of Z and again expanding in powers of 1/T, we have to 
the same accuracy 
1 


ar? ES. (72.16) 





= —TN log S+- E+ 


In this expression, only the terms containing $? contribute to the suscepti- 
bility. Omitting ali other terms, and noting that the odd powers of the spin 
components give zero on averaging, we find 


r=- y g- CEE FY, Daal S n) (SaS). 





The mean values are 
SuSa = s= 0, SE=15(8+0). 
Thus 
F = S BEINGS +1)- 9 a Orr PO ONS(S+1P ¥ Ja 


ao 
and hence we have finally for the susceptibility 


_ 4BPS(S +1)N [i+ +56) 1) 


i yJ a|: (12.17) 


q0 


It should be noted that the sign of the correction term in the square brackets 
depends on the sign of the exchange integral. 


PROBLEMS 


PROBLEM 1. Find the magnetic part of the specific heat of a system described by the Hamil- 
tonian (72.1) at temperatures T >> J. 


SOLUTION. The first term in the expansion of the specific heat in powers of 1/7 comes from 
the term — E?/2T in the free energy (72.16). Averaging by the same method the square of the 
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Hamiltonian (72.1), we get 
E? = 4.2 2 Jia SmsSmy SnoSat 
m n 


_ 4 SUSHI 1 
=3. ND 


since S,S; = 4S(S+ 1) ĉa The specific heat is then 


2 
NSX(S+1)? De J, 


Coss = — G7 


in agreement with Part 1, (73.4). 


PROBLEM 2. Neglecting the interaction between spins, calculate the magnetization of a 
paramagnet with any relation between 8 and T. 


SOLUTION. The partition function (for one spin in the field) is 
8 
Z= $ > : exp (— 28 9S,/T) 


_ _sinh 28G(S+)/T 
sinh BQ/T ' 


A calculation of the free energy, followed by differentiation with respect to ©, gives the magnet- 
ization 
N 
M = — T log Z 


-A (s anA D 4 f8) 


{L. Brillouin 1927). When BQ <« T, this expression becomes (72.15). In the opposite limit, 
when BQ >» T, the magnetization tends to its nominal value according to 


M= ENS {1—exp(-268)} š 


§ 73. Interaction of magnons 


There is considerable methodological interest in the problem of the contribu- 
tion from magnon interaction to the magnetic part of the thermodynamic 
quantities in a ferromagnet. The calculations in §71 were based on an ideal 
gas of non-interacting magnons. Let us consider this problem for a system 
described by the exchange spin Hamiltonian (72.1). 

Having in view the determination of only the contribution of the lowest 
order in the small ratio T/T,, we need only consider the pair interaction of 
magnons. This means that we have to consider two-magnon states of the system 
in which the total-spin projection is NS—2. 

This corresponds to the wave functions 


= [4S(2S—1)]-¥? Sa- Sa—Xo, | 


Xman = (2S)-1 § _Sa—Xo, m ~n; (73.1) 
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since the spin operators of different atoms commute, yn, = Xam- The func- 
tions (73.1) are normalized by the condition yna%mn = 1, as is easily seen by 
expanding the product in the same way as was done in verifying the normaliza- 
tion in (72.9). The same procedure can be used to show that different functions 
Zman are mutually orthogonal. 

The functions (73.1) are not themselves eigenfunctions of the Hamiltonian. 
The wave functions of the two-magnon stationary states of the system must be 
certain linear combinations of the functions ,,,, which we write as 


1 = a. g Paniton + Y, Yanan (73.2) 


(since Yun and Yam are the same, we must also take Yman = Yam) The set of 
coefficients Yma forms the wave function in the representation in which the 
independent variables are the numbers labelling the atoms in the lattice. The 
factor 1/4/2 in the first sum in (73.2) is included so that the squared modulus 
‚gl? shall be equal to the sum 2'y,,|?, in which each of the different pua 
appears once only. 

The same method as was used to derive equation (72.11) for the wave func- 
tions of one-magnon stationary states shows that the functions (73.2) must 
satisfy a similar equation: 


Cn = p? Era ~~ (A, SmS] Xo 


p7 WsOs FF [2, Sa-Sa-] Xo, (73.3) 
where now & = E—Ep is the energy of the two interacting magnons, and the 
brackets [ ] denote the commutator. 

To expand the commutators on the right of (73.3), we note that 


[A, Sa—So—] = (7, Sa] Sa- + Sm-lh, Sa] 


and use the expressions (72.12) for the commutators [#, $,_]. Then, taking 
account of the commutation rules (72.4), we transpose the operators §, to the 
extreme right-hand position, where, acting on the function yọ they multiply 
it by S. The result is 


IA, Sa—Sa—] xo = S E a(S ~— Ñi-) Sa-+J a(S- — Si) Sm—} Yo 
+ mn y Jas Si- Zo— J moSm—Sn—10+4 BOS m—Sa—Yo; (73.4) 
I 


to simplify the formulae, the limitations imposed on the summation variables 
are not shown. The summations are over all values of J, it being implied that 
all the “diagonal” Jy = 0. 


t If the spin S = 3, a twice repeated application of the same operator Sa to the ground- 
state function 7%» gives zero. In this case, therefore, all the “diagonal” functions y,, = 0. 
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The remaining steps are to substitute (73.4) in (73.3) and equate the coeffi- 
cients of like functions y,,, on both sides of the equation. The calculations are 
laborious but elementary. They lead to the following set of equations for the 
Yon: 

(2JS— E) Ym = S È (Jimia + JinYim) + J mapun 
I 


— As kK ma(Ymm + Ya) + 26mn 2 J mim] > (73.5) 


wns)" 


and J denotes the sum Ja, which is obviously independent of the suffix n.t 
I 


where 





In this equation, let us change from the coordinate representation (in which 
the independent variables are the coordinates r,,r,, of the atoms) to the mo- 
mentum representation, putting 


= i iK. (+r)? ik. (tnaa) 
Yan = ae XK, k)e (73.6) 


The vector K plays the role of the total quasi-momentum of the two magnons, 
and k that of the quasi-momentum of their relative motion; the summation is 
over N discrete values of k admissible for a lattice with volume Nv (where N 
is the number of atoms in the lattice and v the volume of the unit cell). As well 
as the Yn the exchange integrals also have to be represented in the form of 
Fourier series: 


Jon = 7 yk mw) J(k), 
r (73.7) 
J(k) = > Jon elk. (To—ra) ` 


since Jan = Jam We have J(k) = J(—k). 
Omitting the straightforward intermediate steps, we shall pass immediately 
to the final result derived from (73.5): 


[e(}K+k)+e($K-k)—2] w(K, k) 


+f UŒ, k, k’) w(K, k’) V d?k' (2x)? = 0, (73.8) 
where 
NU(K, k, k’) = As[J(4.K+k)+J4K—k)+JG¢K+k’)+J($K-k)] 
-lJ (k-k')+J(k+k’)], (2.3) 


t These equations are valid also for spin S = 3, when all the y,, are arbitrary. It should be 
noted that for S = 3 all the “diagonal” quantities Ya» disappear from the equations with 


m = n. The equations with m = n in this case are simply to be regarded as non-existent. 
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and e(k) is the energy of one magnon, determined by (72.13); the summation 
over k’ is replaced by integration over one reciprocal lattice cell. 

Thus the exact problem (for the Hamiltonian (72.1)) of the two-magnon 
states of the system reduces to the solution of an equation exactly similar to 
Schrédinger’s equation for a system of two particles in the momentum repre- 
sentation; cf. QM, (130.4). The functions e(k) play the part of the kinetic 
energies of the particles, and the kernel U(K, k, k’) of the integral equation 
that of the matrix element of the energy U of their interaction for a transition 
(scattering) from states with momenta ki, ke to states with momenta k,, k,, 
where 


kı = 4K+k, k,=35K—-k, ki=4K+k’, kp = +K-k’. 


Then U(K, k, k’) is suitably written in the form 


NU(Ki, ko; ki, k2) = As[J(k1) + J(ke)+ J(ki)-+ J(k2)] 
— #1 (ki— ki) + J(k1—ks)]. (73.10) 


In the general case, equation (73.8) with (73.9) is very complicated. We shall 
calculate only the correction to the thermodynamic quantities when S > 1. 
The simplicity of this case is due to the fact that the magnon energy e(k) is 
proportional to S, and the interaction U of the magnons is independent of S; 
for S > 1, the coefficient in (73.9) is Ay ~ +. Hence U may be regarded as a 
small perturbation, and the correction 2,,, (from the magnon interaction) to 
the thermodynamic potential Q is given by just the mean value of U. By taking 
the “diagonal matrix element” 


U(k1, k3; kı, ko) = oT [J(k1)+ J(Ke)— J(ki-— ke)— JO)I, (73.1 1) 


we average over a State with given magnon quasi-momenta. The statistical 
averaging over the equilibrium distribution of magnons is then carried out by 
integration: 

V2 d3k1 d>ke 


ayer (73.12) 


Qint = fra n(k2) U(kı, ke; Kı, ke) 
where n(k) = [exp (e(k)/T)— 1]? is the Bose distribution function. 

At low temperatures, the integral is governed by the range of small k, and 
kz, and accordingly all the e(k) and J(k) are to be expanded in powers of k. 
Then e(k) is given by the quadratic expression (72.14). Since J(k) is an even 
function of k, the first terms in its expansion are also quadratic: 

J(k) ~ J(0)+ ankike. 
Then 


1 
U(ki, k2; ki, ke) = W dink rik 2x. 
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When this expression, an odd function of k; and ko, is substituted in (73.12), 
the integral is zero as a result of the averaging over the directions of k, and ke. 
In the expansion of J(k), therefore, we have to take into account the fourth- 
order terms, and then in the integral (73.12) the function U(k1, k2; kı, Ke) is a 
quartic form, in which the terms quadratic in k, and in ką make a non-zero 
contribution to the integral. Because of the rapid convergence, the integration 
may be extended over all k-space. We then see, by making the change of 
variables k = k+/T, that the dependence of Q, on T and § has the form 


Rint = VT f(Q/T), (73.13) 


f(0) and f’(0) being finite. Hence it follows that the correction term in the 
magnetization is 





Min = -7 0H 
The correction term in the specific heat has the same dependence.’ 

We see that the interaction of magnons leads to corrections to the thermo- 
dynamic quantities only in a higher approximation with respect to T/T. The 
leading terms in the magnetization and in the magnetic part of the specific 
heat are proportional to T32. Between these and the corrections arising from 
Q; there are others proportional to T°? and T’”, which come from the next 
terms in the expansion of the magnon energy e(k) in powers of k?. 

By means of the equations derived above, we can also consider the question 
of bound states of two magnons. These states appear as discrete (for a given K) 
eigenvalues of equation (73.8). As functions of the variable K, these eigenvalues 
&(K) form new branches of elementary excitations in the system. Analysis 
shows, however, that such states exist only for fairly large values of K, and 
they can therefore never affect the thermodynamic quantities in a ferromagnet 
at low temperatures.’ 


1 l E = constant XT“. (73.14) 
§ =0 


PROBLEM 


Assuming that S >> 1, find the correction terms due to the magnon interaction in the mag- 
netization and the specific heat for a cubic lattice in which the exchange integrals are zero 
except for pairs of atoms that are adjacent (along cubic axes). 


SoLuTion. Each atom has six nearest neighbours. From the definition (73.7) we find 


J(k) = 2J,(cos k,a+cos kya+cos k,a), 


where Jp is the exchange integral for a pair of adjacent atoms, and a the edge length of a cubic 


cell. For small k, 
Jk) © Jo[2— a?k? + peat(k4+ kt+ k4)]. 


t These results (in the general case of arbitrary spin) were first obtained by F. J. Dyson 
(1956). In the derivation of (73.5) given here, we have mainly followed R. J. Boyd and J. 
Callaway (1965). 

+ See M. Wortis, Physical Review 132, 85, 1963. The lattice concerned is three-dimensional. 
Tn one and two dimensions, bound states of magnons exist for all values of k. 
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Hence 
a’, 0 
4V 


the terms that are odd functions of k, and k, are omitted. The magnon energy is, according 
to (72.14), 
elk) = SJga?k? +260. 
The calculation of the integral (73.12) gives the results 
Mint 326(3/2) (5/2) ( T 3 


ee err ee ff 





U(ky, kg; ky, k2) = — (kizkłe + kiykžy + kī:kž:); 





C = 1526/2) N ( T )' 
ee S AnS, /]’ 


where denotes the zeta function. 


§ 74. Magnons in an antiferromagnet 


Antiferromagnets have the characteristic that the magnetic moments of all 
the electrons in each unit cell in the crystal lattice compensate one another 
(in the equilibrium state in the absence of a magnetic field). The magnetic 
moment density is, strictly speaking, distributed throughout the volume of the 
cell. However, in antiferromagnetic insulator crystals, we can suppose with good 
accuracy that this density is practically localized at the individual atoms, to 
each of which a certain magnetic moment may be assigned. These moments, 
periodically repeated in all cells, form the magnetic sub-lattices of the antiferro- 
magnet. 

Different antiferromagnets vary greatly in structure. We shall discuss the 
question of their magnetic energy spectrum for the typical example of a crystal 
with two magnetic atoms at equivalent points in each unit cell (i.e. at points 
that are changed into each other by some symmetry transformations of the 
crystal). The mean magnetic moment densities of the sub-lattices formed by 
these atoms are denoted by Mı and Mg, and we shall use the two vectors 


M=Mi+Me, L = Mı—M2. (14.1) 


In the ground state of the antiferromagnet M = 0 and L + 0, whereas for 
a ferromagnet M = 0 and L = 0. We must emphasize a fundamental difference 
between the ground states in the two cases. In the exchange approximation, 
in the ground state of the ferromagnet, the spin components of all the magnetic 
atoms have definite (the largest possible) values S, = S, corresponding to the 
nominal value of the magnetization M. In the ground state of the antiferro- 
magnet, the magnetizations of the sub-lattices certainly cannot have their 
nominal values, since the total projections of the spins of each sub-lattice 
separately are not conserved (even in the exchange approximation), and there- 
fore do not have definite values (in a stationary state). The spin components 
of the individual atoms therefore also do not have definite values. 
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The form of the macroscopic “equations of motion” of the vectors L and M 
is established similarly to that for a ferromagnet (§69). The condition for ab- 
sence of dissipation leads to the requirement that, because of the equations of 
motion, 


ôL 0M), _ 
-—=- = [fa te She =0, (74.2) 


where the “effective fields” H, and H are determined by the expression 


ôF =—{ (H1 .ôL+H m . 6M) dV (74.3) 


for the change in the free energy when L and M are varied; in equilibrium 
H, = Hy = 0. 

In the exchange approximation, the required equations must be invariant 
under simultaneous rotation of all the magnetic moments relative to the crystal 
lattice. This implies not only the crystallographic equivalence of the positions 
of the two magnetic atoms in the cell but also the invariance under exchange of 
Mı and Mbp, i.e. under the transformation L ~ —L, M — M. Because the 
free energy is unaffected by this transformation, we have also H, ~—H,, 
Hy + Hy. 

Considering small oscillations of the magnetic moments, we put L = Lo+] 
M = m, where | and m are small quantities. In the linear approximation, 
the equations of motion satisfying the conditions stated are 
3 = yHm XY, = = yH Xv, (74.4) 
where v is a unit vector in the equilibrium direction of the vector Ly; the trans- 
formation L —>—L implies that v + —v. Here we have used the fact that H, 
and H,,, which are zero in equilibrium, are themselves linear in 1 and m, and 
that v is the only disposable constant vector. By analogy with §69 the coeffi- 
cient y could be written as y = (gle|/2mc)Lo, but unlike the ferromagnetic 
case we now have g = 2 even if relativistic effects are neglected. For mono- 
chromatic oscillations 01/0t = —iol, etc., and the vectors 1 and m defined by 
(74.4) are then perpendicular to v. In the approximation considered, this means 
that the vector L precesses about the direction of v with a constant magnitude 
Le Lo. 

To determine the effective fields H, and H,,, we must establish the form of 
the free energy of the crystal. The terms needed are those of the second order 
in the small quantities 1 and m, or in the case of terms containing derivatives of 
these quantities with respect to the coordinates, those not above the second 
order in the wave number of the oscillations, whose wavelength is assumed large 
compared with the lattice constant (as in §70). In the exchange approximation, 
the free energy must be invariant under simultaneous rotation of all the 
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magnetic moments, and also under a change in the sign of L. An expression 
satisfying all the conditions stated is 


ht i E ð ôm) 1 aa 
Fao = | [za +56 (m. = Lae) tye ae ax af” (74.5) 


where the z-axis can be taken parallel tov, so that a change in the sign of vimplies 
a change in the sign of z. The coefficient a>0, in accordance with the fact that 
m must be zero in equilibrium. The term in I? is absent here, since it would imply 
a dependence of the energy on the direction of the vector L = Lo+1 in the 
crystal which does not occur in the exchange approximation. The term con- 
taining the sum m.ol/0z+1.0m/0z reduces to a total derivative and would 
vanish on integration over the volume. Lastly, the terms quadratic in the deriv- 
atives Om/Ox; need not be considered, since they are certainly small in compar- 
ison with the term in m2, 
Varying the integral (74.5) (and integrating by parts), we obtain 
H, = p zta k’ Hu = —am—b = (74.6) 
For a plane monochromatic spin wave, the equations of motion (74.4) now 
give 
B = —yam Xv—ik,ybl Xv, (74.7) 
—iom = ik,ybm Xv—ya(n) kX, 


where again (as in §70) «(n) = œn, and n is a unit vector in the direction of 
k. Multiplying the first equation vectorially by v gives 


yam = —iolXv—ik.ybl, (74.8) 


and substitution of this in the second equation leads directly to the following 
dispersion relation for spin waves: 


w = yk[ax(n)—b?(v .n)?]"!2. (74.9) 


Thus the frequency of the spin waves, and thus also the magnon energy ¢ = fiw, 
in an antiferromagnet in the exchange approximation are proportional to k, 
not to k? as in a ferromagnet. 

Equations (74.7) establish a one-to-one relation between l and m, but the 
two components of l in the plane perpendicular to v remain arbitrary. This 
means that the spin waves in the antiferromagnet considered have two inde- 
pendent directions of polarization. 


t This dispersion relation for antiferromagnets was first derived by L. Hulthén (1936). The 
derivation using the macroscopic treatment of the magnetization of the sub-lattices was given 
by M. I. Kaganov and V. M. Tsukernik (1958). 
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To take account of magnetic anisotropy, we must make more specific 
assumptions as to the symmetry of the crystal. Let this be uniaxial, and let 
the equilibrium direction of L be parallel to the axis of symmetry.’ 

It is seen from (74.8) that the vector m in the spin wave is small compared 
with 1, containing an extra power of the small wave vector k. In the same way, 
the effective field H,, >> H,. For this reason, it is sufficient to consider the an- 
isotropy due to the vector 1. With the assumptions made, the density of this 


energy is 
Uan = 4 KP, (74.10) 


where K > 0. When it is taken into account, there is a further term — K] in 
the effective field H,, which for a plane wave is then 


Hy, = ik,bm—[a(n)k2+K]]}. (74.11) 


Hence we find that, with allowance for the anisotropy, the spin wave disper- 
sion relation is obtained from (74.9) by adding K to «k?. The result is that, 
when k — 0, the magnon energy tends not to zero, but to the finite value 


e(0) = yy (aK) (74.12) 


(C. Kittel 1951). The frequency œ(0) = «(0)/A is called the antiferromagnetic 
resonance frequency. It should be noted that the gap in the spectrum is propor- 
tional to the square root of the anisotropy constant, not to this constant itself 
as in (70.12). Since the smallness of the relativistic effects is expressed by the 
relative smallness of the anisotropy constant, we see that these effects are in 
general more important in an antiferromagnet than in a ferromagnet. 

The magnon contribution to the internal energy of an antiferromagnet is 
calculated from (71.3). In the temperature range e(0) «< T « Ty (Ty being the 
Néel point, the temperature at which the antiferromagnetism disappears), 
we can use the spectrum (74.9). In a uniaxial crystal, 


o = palate) ae}, 0g = as— bja, 


The calculation of the integral (71.3) gives the following result for the magnon 
contribution to the specific heat: 


47°T8 


C mag 
At temperatures T< e(0), on the other hand, the magnon contribution to 
the thermodynamic quantities is exponentially small. 


t This type includes the antiferromagnet FeCO 3, with a rhombohedral lattice (crystal 
Class D34) and two iron ions in the unit cell. The magnetic moments of these ions are in oppo- 
site directions along the third-order axis of symmetry (the z-azis). 


CHAPTER VIII 


ELECTROMAGNETIC FLUCTUATIONS 


§ 75. Green’s function of a photon in a medium 


TURNING now to study the statistical properties of an electromagnetic field in 
material media, let us first recall the significance of the averagings of electro- 
magnetic quantities in macroscopic electrodynamics. 

If we start, for ease of visualization, from the classical standpoint, we can 
distinguish the averaging over a physically infinitesimal volume with a given 
arrangement of all the particles in it, followed by averaging the result with 
respect to the motion of the particles. Maxwell’s equations of macroscopic 
electrodynamics involve fully averaged quantities. In considering field fluctua- 
tions, however, we are concerned with the oscillations in time of quantities 
averaged only over physically infinitesimal volumes. 

From the quantum-mechanical standpoint, we can of course speak of aver- 
aging over a volume only for the operator of a physical quantity, not for the 
quantity itself; the second step is to find the mean value of this operator by the 
use of quantum-mechanical probabilities. The operators that occur in this 
chapter will be understood to be averaged only in the first sense. 

The statistical properties of electromagnetic radiation in a material medium 
are described by the Green’s function of a photon in the medium. For photons, 
the y operators are replaced by the operators of the electromagnetic field po- 
tentials. The photon Green’s functions are defined in terms of these operators 
in the same way as they are for particles in terms of the y operators. 

The field potentials form a 4-vector A“ = (Æ, A), where Æ? = ¢ is the scalar 
potential and A the vector potential. The choice of these potentials is not 
unique in classical electrodynamics: they allow a gauge transformation, which 
has no effect on any observable quantities (see Fields, §18). Correspondingly, 
in quantum electrodynamics there is a similar non-uniqueness in the choice 
of the field operators, and therefore in the definition of the Green’s functions 
of the photon. We shall use the gauge in which the scalar potential is zero: 


A = o=0, (75.1) 


so that the field is determined by the vector potential only. This gauge is usually 
convenient for problems involving the interaction of the electromagnetic 
field with non-relativistic particles, as in the case of a field in ordinary material 
media. 
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In this gauge, the Green’s function is a three-dimensional tensor of rank two, 
DilX1, X2) = KTÂ(X1) (Xo), (75.2) 


where i, k = x, y,z are three-dimensional vector suffixes and the angle brackets 
denote (as in (36.1)) averaging over the Gibbs distribution for a system consist- 
ing of the medium and the radiation present in it in equilibrium; since the 
photons are bosons, there is no change in the sign of the product when the 
operators 4,, A, are interchanged by the chronological operator. Moreover, 
the operators A, are self-conjugate (since the photon is a strictly neutral par- 
ticle); no distinction is therefore made between A, and Â} in (75.2).* 

The basic concept in the construction of all photon Green’s functions must, 

however, be not (75.2) but the retarded Green’s function, defined by 
iDÈ (Xi, X2) = ie 1) A(X2)— Âk( X2) A(X), t > tz, (75.3) 
0 ti < l2; 
the minus sign between the two terms in the angle brackets corresponds to the 
definition (36.19) for Bose statistics. 

For a closed system, the Green’s function depends on the times ft, and tz 
only through their difference ¢ = tı— tə. The coordinates rı and re, in the general 
case of an inhomogeneous medium, appear in the Green’s function independ- 
ently: DR(t; rı, r2). Accordingly, the Fourier expansion will be made only 
with respect to time, the component being 


DR(w; r1, r2) = f et DR (t; ry, ro) dt. (75.4) 
0 


By considering quantities averaged over physically infinitesimal volumes, we 
restrict ourselves to considering only the long-wavelength part of the radiation, 
in which the photon wave numbers satisfy the condition 


ka< 1, (75.5) 


where a denotes the interatomic distances in the medium. In this frequency 
range, the photon Green’s function can be expressed in terms of other macro- 
scopic characteristics of the medium, the permittivity e(w) and the permeabil- 


ity u(o). 
To do so, we write the operator of the interaction of the electromagnetic 
field with the medium, 
peu [j.Ady, (75.6) 


~ c 
ed 


t In the general case of an arbitrary gauge of the potentials, the photon Green’s function 
is a 4-tensor D; in the gauge (75.1), Doo = 0, Dos = 0. The general tensor and gauge proper- 
ties of the photon Green’s function in statistical physics are the same as in quantum electro- 
dynamics of a field in the vacuum. The definition (75.2) differs in sign from that used in ROT. 
It is chosen here so as to correspond to the definition of the Green’s functions of other bosons, 
including phonons. 
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where j is the electric current density operator due to the particles of the me- 
dium." If a classical “external” current j(t, r) is put into the medium, there is a 
corresponding interaction operator 


=-1 f j(t, 1). Â dx. (75.7) 


This expression enables us to establish a connection with the general theory of 
the response of a macroscopic system to an external interaction. 

In this theory (see Part 1, §125) there appears a discrete set of quantities 
x, (a = 1, 2, ...) which describe the behaviour of the system under certain 
external interactions. These interactions are described by “perturbing forces” 
F,( such that the interaction energy operator has the form 


Ps— aa 


where £, are the operators of the quantities x,. The mean values x,(t) brought 
about by the perturbation are linear functionals of the forces f(t). For the 
Fourier components of the quantities, this relation is written 


Xas = F %a6() fow 
b 


(we assume that x, = 0 in the absence of the perturbation). The coefficients 
x In these relations are called the generalized susceptibilities of the system. 
If the two quantities x, and x, behave in the same way under time reversal, 
and the body is not magnetoactive (has no magnetic structure and is not in 
a magnetic field), the «,, are symmetrical with respect to their suffixes. 

Here we are concerned with quantities f, and x, that are functions of the 
coordinates r of a point in the body and thus have distributions. In this case, 
the expression for V is to be written 


P=-Y | falt, £) 2(t, r) Bx, (75.8) 


and the relation between the mean values x, and the forces f, is 
Xaw(K) = È f aalo; r, r') fool’) dx’, (75.9) 
b 


The generalized susceptibilities now become functions of the coordinates of 
two points in the body, and their symmetry is expressed by 


XablO; E, ¥’) = Abal; r’, r). (75.10) 


t See RQT,§53 (where the current is denoted by ej, i.e. the unit charge e is excluded from 
the definition of j). The operator (75.6) assumes the use of the relativistic expression for the 
current operator. In non-relativistic problems, we can neglect in the y operators (from which 
the current operator j is constructed) the parts due to negative frequencies, i.e. to antiparticles. 
This means, in particular, neglecting the radiative corrections that alter the photon Green’s 
function in the vacuum because of the virtual electron—positron pairs. These corrections are 
negligible at wavelengths A >> //mc, a Condition that is certainly satisfied in the range (75.5). 
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According to Kubo’s formula (see Part 1, (126.9)), the susceptibilities can be 
expressed in terms of the mean values of the commutators of the Heisenberg 
operators X(t, r): 


Xablw; r, r’) = zl eiet. Z(t, r) 2a(0,r')— (0, r’) £.(t, r)) dt. (75.11) 


We shall now regard the components of the current vector j as the “forces” 
Ja Then it is seen by comparing (75.7) and (75.8) that the corresponding quan- 
tities x, are the components of the vector potential A/c of the field. A compari- 
son of (75.11) with the definitions (75.3), (75.4) shows that the generalized 
susceptibilities «,,(w; r, r’) are the same as the components of the tensor 
— DR (w; r, r’)/fc?. By (75.10), this immediately gives (for non-magnetoactive 
media) 

DE; r,r) = DE(w;r’, r). (75.12) 


The relations (75.9) become 


A;.(r) = — ic | DR(w35r, r^) jor’) Px’. (75.13) 


The mean value A is just the vector potential of the macroscopic (fully aver- 
aged; see the beginning of this section) electromagnetic field in the medium; 
henceforward the bar will be omitted from A and the other macroscopic quan- 
tities. We now use the fact that the macroscopic field due to the classical current 
j satisfies Maxwell’s equation 


aima pe 
C Cc 


where D is the electric induction; in the general case of an anisotropic medium, 
D, is related to the field E,, by D,,, = &(@)E;,,,; if the medium is inhomogeneous, 
the permittivity tensor is also a function of the coordinates, ¢,(, r). 

In the gauge chosen for the potentials (75.1) we have 


B, = curl A,, E, = iwA,/c, (75.14) 


where B is the magnetic induction, related to the field H by B,, = a: Pl 
The potential therefore satisfies the equation* 


[curl in (han curl,,,)— (w?/ c?) Eik] Akw = 4rtjiol C. 


t In macroscopic electrodynamics the mean value of the microscopic electric field is usually 
denoted by E, and the mean value of the magnetic field by B, called the magnetic induction. 

+ Here and henceforward we use the notation curl, = eg0/Ox,, where em is the unit 
antisymmetrical pseudo-tensor; (curl A), = curl,A;. 
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Substituting A,,in the form (75.13), we find that the function DÈ must satisfy 
the equation 


[curl (1 mn CUrl,1)— (@?/c?) ey] DRC; r, r’) = — 40hd;,6(r—r’). (75.15) 


This equation is considerably simpler for media that are isotropic (in each 
volume element), where the tensors £ and u reduce to scalars. The permeabil- 
ity is usually close to unity, and we shall take u = 1 in the rest of this section. 
Putting €p = Eô and wu, = Ôp, we get the equation 


2 
EZT ÖA — Oy S elw; pl DR (w; r, r') = —4ohb,8(r—r’). (75.16) 

Thus the calculation of the retarded Green’s function for an inhomogenous 
medium amounts to the solution of a certain differential equation (I. E. Dzya- 
loshinskii and L. P. Pitaevskii 1959).* 

At the interfaces between different media, the components of the tensor DÈ 
must satisfy certain conditions. In equation (75.16), the second variable r’ and 
the second suffix k are not involved in the differential and algebraic operations 
on the tensor DŽ, i.e. they act only as parameters. Hence the boundary condi- 
tions are to be imposed only with respect to the coordinates r for the function 
DR(w; r, r’), regarded as a vector with the suffix /. These conditions correspond 
to the requirement that the tangential components of E and H be continuous, as 
given by macroscopic electrodynamics.* Since E = —A/c, the role of the vector 
E is here played by the derivative —(1/c) ODR(t; r, r’)/Ot or, in Fourier com- 
ponents, 

i(w/c)DR(w; r, r’). (75.17) 


Similarly, the role of the vector H (which is the same as B when u= 1) is played 
by | 
curl; Diw; r, r’). (75.18) 


For a spatially homogenous infinite medium, the function DÈ depends only 
on the difference r—r’. For the components of the Fourier expansion with re- 
spect to this difference, the differential equation (75.16) becomes a set of alge- 
braic equations 


i| w2 
Ach Mik bk? + ôa = e(o)| DR(w, k) = ôk. (75.19) 


t The function DZ is the Green’s function of Maxwell’s equations in the sense familiar in 
mathematical physics, namely the solution of the field equations with a point source, satis- 
fying the retardation condition. The advanced function D4 satisfies the same equation with 
€* instead of £. 

+ The boundary conditions for the normal components of B and D give nothing new in this 
case, because in a field that varies with time as e~™! the equations div D = 0 and div B = 0 
are implied by the equations curl E = ‘wB/c and curl H = —iwD/c. 
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Their solution is 


Anh c*k ik, 
Dio, k) = w*e(@)/e— k? [ou seca | 9 (75.20) 


According to (36.21), the Green’s function D,, for a homogeneous medium 
is expressed in terms of the retarded function Df. by 


Dum, k) = re DE(w, k)+i coth (a/2T) im DR(m,k). (15.21) 


AsT — 0, this becomes 
Dio, k) = re DR(w, k)+i sgn w-im DR(w, k). (75.22) 


The function DÈ is given by (75.20); since re e(w) is an even function of w, and 
im (w) an odd function, we find at T = 0 


In a vacuum, (w) = 1. Since in any material medium im e(w) > 0 when 
œw > 0, the vacuum corresponds to the limit e ~ 1+i0. The resulting expres- 
sion is 


Anh cekk 
DP(w, k) = TEk i0 (èu) , 


in agreement with the known result in quantum electrodynamics (see RQT, 


§77). 


§ 76. Electromagnetic field fluctuations 


As already mentioned at the beginning of §75, in the treatment of electromag- 
netic field fluctuations we are concerned with the oscillations in time of quan- 
tities averaged only over physically infinitesimal volume elements (not over the 
motion of particles within them). The quantum-mechanical operators of these 
quantities are to be taken in the same sense. 

The basic formulae of the theory of electromagnetic fluctuations can be 
written down directly from the general formulae of the fluctuation—dissipation 
theorem (Part 1, §125). For a discrete set of fluctuating quantities x, the spec- 
tral distribution of the fluctuations is expressed in terms of the generalized sus- 
ceptibilities «,,(w) by 


(XaXb)o = Zth(cfa—eep) coth (fw/2T), 


where (x,x;),, is a component of the Fourier expansion with respect to time of 
the correlation function 


Padl!) = % a(t) ACO) + CO) Falt), 


and £,(t) are the Heisenberg operators of the quantities x,. For distributed 
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quantities x,(r) (functions of the coordinates of the point in the body), this for- 
mula becomes 


(xOxP),, = Fih coth (heo/2T) lata; re, r1)—%ap(@3 ri ¥2)], (76.1) 


where the superscripts (1) and (2) denote that the values are taken at the points 
rı andro. 

In §75 it has been shown that, if the quantities x, are the components of the 
vector potential A(r)/c, the corresponding generalized susceptibilities are the 
components of the tensor — DR (w; r1, r2)/fc?. We therefore find immediately 


(APAP),, = Zi coth (hw/2T){DR(w; r1, r2)—[DE(@; r2, r1)]*}. (76.2) 


The spectral functions of the field fluctuations are easily found from (76.2). 
Let $4.(t1, r1; tə, ra) be the correlation function of the fluctuations of the vector 
potential; the expression (76.2) is the component of the Fourier expansion of 
this function with respect to t = t1— tz. Since the electric field E = —A/c, the 
corresponding function for the components of E is 


pik = @ Ot Ota Pik — <a ae Pls 
or, in Fourier components, 
(EPEP)y = (w8/c2)(APAP)., (76.3) 
Similarly, since B = curl A, we have 
(BP BP), = curl{P curl®, (4A)... (76.4) 
(EP BP). = (iw/c) curl H A A®),,. (76.5) 


By expressing the correlation functions of the electromagnetic fluctuations in 
terms of the retarded Green’s function, formulae (76.2)-(76.5) reduce their 
calculation to the solution of the differential equation (75.15) or (75.16) wth 
the appropriate boundary conditions at the specified interfaces." 

We shall suppose henceforth that the medium is not magnetoactive. Then the 
function DÈ has the symmetry property (75.12), and (76.2) becomes 


(APAP) = —coth (iw/2T) im DR(w; r1, r2). (76.6) 


It should be noted that the expression (76.6) is real, and therefore so are 
(76.3) and (76.4), while (76.5) is imaginary. This means that the time correla- 
tion functions between components of E, and between those of B, are even func- 
tions of the time t = tı— tg (as they should be for a correlation between quanti- 
ties that are both even or both odd under time reversal). But the time correla- 
tion function of the components of E with those of B is an odd function of the 


t The theory of electromagnetic fluctuations was developed in another form by S. M. Rytov 
(1953), and in a form equivalent to (76.2)-(76.5) by M. L. Levin and S. M. Rytov (1967). 
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time (as it should be for two quantities of which one is even and the other odd 
under time reversal). Hence it follows that the values of E and B at any one in- 
stant are not correlated (an odd function of t is zero when t = 0). Together 
with the correlation function, the mean values of any expressions bilinear in E 
and B (taken at the same instant), for example the Poynting vector, are zero. 
This is in fact obvious, since, in a body in thermal equilibrium and invariant 
under time reversal, there cannot be any internal macroscopic energy fluxes. 


§ 77. Electromagnetic fluctuations in an infinite medium 


In a homogeneous infinite medium, the functions DÈ (œ; r4, r2) depend only 
on the difference r = rı—rz and are even functions of r; equation (75.15) con- 
tains only second derivatives with respect to the coordinates, so that DR(w; r) 
and Di(w; —r) satisfy the same equation. Taking the Fourier components 
with respect to r of both sides of (76.2), we find 


(APAP) x = zi coth (hw/2T){Di(w, k)—[DE(@, k)]"}. (77.1) 
For non-magnetoactive media, with (75.12), this becomes 
(APAP).~ = —coth (fw/2T) im DR, k). (77.2) 


In an isotropic non-magnetic (u = 1) medium, the function D2(w, k) is given 
by (75.20). The problem of finding the spatial correlation function of the fluc- 
tuations reduces to the calculation of the integral 


Di(w; r) = f| Di(w, k) e&-* d3k/(2n). (77.3) 
The integration is effected by means of the formulae 
eik.r dk err 


e+ Ona dar’ 


kikket dk — @ e 
| e T T Dð Sar’ ORD 


the first of which is obtained by taking Fourier components of the known 
equation 


(a—#) a = —4n8(n), (77.5) 





and the second by differentiating the first. The result is 
e æ Í or 
we Ox:Ox, at ew (—“Tv— e) (79 


Dior) =—h | Ox - 


where r = |ra—rı| and y —e is to be taken with the sign that makes 
re 4/ — € > 0; for a vacuum we must put £ = 1, y —e = —i (see below). 
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Hence, using (76.6) and (76.3), we have immediately 


Ew? 


1 ð? 1 
(EMEP). = h coth -pim | & a Oat Beam | 7 oP (— v-e) 
(11.7) 


(S. M. Rytov 1953). Contracting this expression with respect to the suffixes i and 
k, and using formula (77.5), we obtain 


2 
(ED. E®), = 2 coth ~ im le Se exp ( Sy e) +2n009|| . (778) 
Similarly, a calculation from (76.4) gives expressions for the correlation func- 
tions of the magnetic field which differ from (77.7) and (77.8) by the absence 
of the factor 1/e in front of the square bracket; the delta-function term in (77.8) 
then has no imaginary part and does not appear in the result. 

The occurrence of the imaginary part of ¢ in the expressions (77.7) and (77.8) 
shows clearly the relation between the electromagnetic fluctuations and the 
absorption in the medium. But if we take the limit im € — 0 in these expressions, 
we obtain non-zero results. This is connected with the order in which two limits 
are taken, those of an infinite medium and zero im e. Since in an infinite me- 
dium an arbitrarily small im e eventually gives rise to absorption, with our order 
of taking the limits the result pertains to a physically transparent medium in 
which, as in any actual medium, there is still some absorption. 

For example, let us take these limits in (77.8). To do so, we note that for a 
small positive im e(withw > 0) 


Vn-e x—iyree. (143 De ) 





2ree 


(using the condition that re y/—e€ > 0). Hence, in the limit as im € —0, we ob- 
tain 
2w?h conr hw 


2, Sin —— coth Or’ (77.9) 





(EW. ED), = an (H.H), = 
n 


where n = ye is the real refractive index. Because the delta-function term is not 
present, this expression remains finite even when the points r, and re coincide: 


3h ñ 
a “5 z coth =. (77.10) 





(Eo = -7 Z E = = 


The passage to the limit of a transparent medium could also be made at an 
earlier point in the calculation, namely in the Green’s function. Since the sign 
of im e(w) is the same as that of w, we find that in this limit the function (75.20) 
becomes 


Di(o, k) = 


Anh Ckikt 
| nS | f (77.11) 


œn? |e- k? +i0.sgn w wn? 
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The imaginary part of this function depends only on the way in which the poles 
w = +ck/n are avoided; separating it by means of (8.11) and substituting in 
(77.2), we obtain 


2mh /w? kikk nw hw 
(EPE) = T a Ôik — e)l af T (P44) coth FF 


(77.12) 


The arguments of the delta functions in this expression have a simple physical 
significance: they show that the field fluctuations with a given value of k are 
propagated in space with velocity c/n, equal to that of propagation of electro- 
magnetic waves in the same medium. The inverse Fourier transformation of 
(77.12) leads us back to (77.7), of course. 

The energy of the fluctuation electromagnetic field in a transparent medium 
(with u = 1)in the spectrum range dw is, per unit volume of space, 


Aos) 








a 


zX Ee tAE 


see ECM, §61.* Substitution of (77.10) readily gives 


o | wn? d(nw) 
“AoT f we As dw. (77. I 3) 





fi 


The first term in the square brackets is due to the zero-point oscillations of the 
field; the second term gives the energy of electromagnetic radiation in thermo- 
dynamic equilibrium in a transparent medium, i.e. the energy of black-body 
radiation. This part of the formula could also have been obtained without con- 
sidering fluctuations, by means of a generalization of Planck’s formula for 
black-body radiation in a vacuum. According to this formula, the energy of 
black-body radiation per unit volume in the wave vector range dk is 


ha 2d°k 
eto] * (27) ’ 


the factor 2 takes account of the two directions of polarization. Correspondingly 
to obtain the spectral energy density we must replace d3k by 4ak2dk and sub- 
stitute k = w/c. To change from a vacuum to a transparent medium, it is 
sufficient to write k = nw/c, i.e. 


w?n? d(nw) 


2 = p2 = 
k? dk = K(dk/dw) do = S A 


dw 








which gives the required result. 


t The total energy is found by integrating with respect to w from 0 to co; the factors of 
2 in the square brackets arise because, by our definition of the spectral functions of the fluctua- 
tions, the mean value (x*) is obtained by integrating (x), with respect to w/27 from — œœ to ce 
(see Part 1, (122.6)). 
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PROBLEMS 


PROBLEM 1. Find the electromagnetic field fluctuations at a large distance from a body 
embedded in a transparent rarefied medium with which it is thermal equilibrium; the wave- 
length of the radiation and the distance from the body to the point considered are large com- 
pared with the size of the body. The body has an anisotropic electric polarizability «,(w). 


SoLuTION. The rarefied transparent medium is regarded as a vacuum. The required fluctua- 
tions are determined by the change in the vacuum Green’s function due to the presence of the 
body; this change is small (at large distances). To calculate the change, we start from an 
analogy in which the vacuum function D} (w; r, r’) (for a given suffix k) may be formally 
regarded as the electric field E,(r, r’) at the point r due to a source at the point r’. This analogy 
is based on the fact that the field E,(r, r’), like its potential A,(r, r’), satisfies for r = r’ a similar 
equation to D(a; r, r’), namely (75.16) with € = 1. Let the body be at the point r = 0. The 


field 
E,(0, r | = Dio; 0, r ') = Do; r’), 


where DE(w; r) is the Green’s function in the vacuum in the absence of the body (given by 
(77.6) with € = 1), polarizes the body and thus creates at the point r = 0 a dipole moment 
di = %,DE(w; 0, r’). The field created in turn by this dipole moment at the point r gives the 
required change 6DE(w; r, r’). According to a formula in electrodynamics (see Fields, §72), the 
field created at the point r by a dipole moment d (varying with time as e~**) is 

w? oO? erle 

E Pa rE, cae ees 
Tdi | g lut Ox, Ox; : 

where the distance r has to be large only in comparison with the size of the body, not with the 
wavelength. This expression may be written 


w? 
E = Fer Di(w;r) d, 
(the function DZ; r) is even inr). With the dipole moment as given above, we therefore 
have 
ODE(w; r, r) = — (w?/fic?) DECO; 1) tim Dilo; r’). 


The required correlation functions of the fiuctuations are now found from the general formulae 
(76.3)-(76.6) with ÔD}, instead of D}. The final result is 


2 
APAP = oe {5+ ae} m (DEO; r) anD: rD] a) 


The body is at r = 0; r, and r, are two points remote from it. There is a contribution to the 
fluctuations not only from the imaginary part of the polarizability but also from its real part; 
the latter contribution may be regarded as the result of scattering by the body of black-body 
radiation occupying the transparent medium. 


PROBLEM 2. The same as Problem 1, but for a body with magnetic polarizability ag(w).t 


SoLvuTION. In this case we regard curl, D%(w;r,r’)as the magnetic field H,(r, r’) created at 
the point r by a source at the point r’; an equation of the same form as for the function DẸ is 
satisfied not by the field H, but by its potential A,. This field magnetizes the body, creating 


at r = 0a magnetic moment 
M, = — Xq curl, Dlo; 0, r’); 


t The presence of a magnetic polarizability does not necessarily mean that the body con- 
sists of a magnetic material; for example, we may be considering the displacement of the mag- 
netic field from the body because of the skin effect. 
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the differentiation with respect to r is replaced by one with respect to r’, using the fact that 


DE, depends only on the difference r—r’. The required change in the Green’s function is equal 
to the vector potential of the magnetic field created by this magnetic moment at the point rt 


A, = curly [> m ctor] ; 


see Fields, §72, Problem 1. Thus 
ODiw;r, r’) = — (curta = 


rje 





) Sim Curling DE(o; 0, r’). 


Lastly, substituting DZ, from (77.6), we find 
efor’ Ie 
r 








EDR; r, r) = A (curly = ) tim cur (2 


r 


using the fact that curl, Va = Emm Vi Vn = 0. 


PROBLEM 3. Determine the fluctuations of the electromagnetic field in the conditions of 
Problem 1, but assuming that the temperature of the medium is much less than that of the body. 


SOLUTION. The field calculated in Problem 1 separates naturally, according to the two terms. 
in the braces in (1), into zero-point fluctuations and thermal black-body radiation. The latter 
in turn consists of two parts; the thermal radiation of the body itself, and the field resulting. 
from scattering of black-body radiation by the body. If the temperature of the medium is low, 
the second part does not appear. In solving the problem, we calculate that part separately and 
then subtract it from (1). Let Af) = AW +A”, where AW is the fluctuational field in the 
absence of the body, and A the field scattered by the body. At large distances, where A is 
small, we can neglect the terms quadratic in A“ when calculating 6(A;1Az2). The scattering 
contribution is therefore 


8A Arson © (APAY) 0+ (AVAL) = (APAR )o + (ABAP)S. 


The scattered field is again given by the formula in Fields, §72, but the dipole moment is now 
to be taken simply as that induced by the black-body radiation, d; = &_4°(0). Again using 
the Green’s function in the vacuum in the absence of the body, we have 


2 
APD = — Fz DAO; T1) aml) AWO), 


so that 
2 
(ARADo =- -FT DEO; T1) im AIO) APCs) ar 


The correlation function (4194$), is again taken from (76.2). Since we are interested only 
in the thermal radiation, the zero-point oscillations in this formula are to be omitted, with the 


change 
Epp Oh ae 
2 Oe oT ~ eer y 2 eo 


The result for the contribution of scattered black-body radiation to the correlation function is 


2 
DN Anda = gay WHO: ED dm im Dio; r) + DEO; r) ah, im DE; eI} 


(3) 
Lastly, in order to find the fluctuational feld in a cold medium, we must subtract (3) from (1). 
A simple rearrangement using the symmetry of the tensors Dy and «,, gives 
2%? 


OP (AgAze)a = ECOLES DROW; rli Arn (0)] DZE; ts), (4) 
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where T is the temperature of the body. Only the thermal term has been written here: the zero- 
point oscillation term in (1) remains unchanged. It should be noted that the expression (4) for 
the thermal radiation of the body depends only on the imaginary part of the polarizability. 
The energy flux calculated from (4) is not zero; it gives the intensity of thermal radiation from 
the heated body into the surrounding cold medium. 


§ 78. Current fluctuations in linear circuits 


Another interesting application of the fluctuation-dissipation theorem is the 
problem of current fluctuations in linear circuits, first discussed by H. Nyquist 
(1928). 

The current fluctuations are free electrical oscillations in the conductor (i.e. 
they occur in the absence of any externally applied e.m-f.). In a closed linear 
circuit the oscillations of greatest interest are, of course, those in which a non- 
zero total current J flows in the conductor. In what follows we shall assume 
that the condition for a quasi-steady state holds: the dimensions of the circuit 
are small compared with the wavelength 2 ~ c/w. Then the total current J is 
the same at every point in the circuit, and is a function of time only. 

This current J may be taken as the quantity x(t) which appears in the general 
formulation of the fluctuation-dissipation theorem (Part 1, §124). In order to 
ascertain the meaning of the corresponding generalized susceptibility «, let us 
suppose that an external e.m.f. & acts in the circuit. Then the rate of dissipation 
of energy in the circuitisQ = J£. A comparison with the expression Q = — xf, 
which serves to define the “force” f (see Part 1, (123.10)), shows that f = —2, 
or in Fourier components &,, = iwf,. On the other hand, the current and the 
e.m.f. in a linear circuit are related by ¢,, = Z(w) J,,, where Z(w) is the im- 
pedance of the circuit. We therefore have 


Jo = lZ = ivof/Z, 


and comparison with the definition of the generalized susceptibility in the rela- 
tion (¥), = «(w)f gives «(w) = iw/Z(w). The imaginary part of « is 


ima = im (iw/Z) = (w/| Z|?) R(w), 
where R = re Z. 
According to the fluctuation—dissipation theorem, 


(xX) = Á coth (fw/2T).im (w), 
we now find as the spectral function of the current fluctuations 
(J)o = [Aw]! Z(@) |?] R(@) coth (Aw/2T). (78.1) 


This formula can be put in another form by regarding the current fluctuations 
as resulting from the action of the “random” e.m.f. £, = ZJ,,. This gives 


(22), = hwR() coth (fio/27). (78.2) 
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In the classical case (iw « T). 
(Co = 2TR(w). (78.3) 


We must again emphasize that these formulae are entirely independent of the 
nature of the phenomena responsible for the dispersion of the circuit resistance. 


§ 79. Temperature Green’s function of a photon in a medium 


The temperature Green’s function of a photon in a medium is constructed 
from the Matsubara operators of the electromagnetic field potentials in the 
same way as the time Green’s function (75.2) is constructed from the Heisen- 


berg operators: : i 
Din = — (T, AMT, r1) AZ (To, r2)). (79.1) 


Here we have used the fact that, since the Schrödinger operators of the field 
are Hermitian, the Matsubara operators Â™ and A™ (defined as in (37.1)) are the 
same. These operators, however, unlike the Heisenberg ones, are not them- 
selves Hermitian : since the parameter 7 is real, we have 


[ AM, r) Toa [exBrin A(r) e-i'in] + p-ht'in A(r) eri’ It, 
or : 
[ÂM(r, r)]+ = ÂM(—r, r). 
Since the function (79.1) depends only on the difference t = Tı — Tə (cf. §37), 
we can write (taking, for example, t > 0) 
Dit; r1, r2) = —(AM(r, r,) A¥ (O, ra), 

Di —T; Ti, r2) = —(AM(r, To) AM(0, r)). 

A comparison of these two expressions shows that 


Di —T; Fis r2) = Dit; Yo, rı). (79.2) 


The function D,, can be expanded in a Fourier series in the variable 7: 


Di fT; 01, r2) = T y Dir(ls3 ri, r2) e787, (79.3) 


S = ~ 


the “frequencies” ¢, taking values such that AC, = 2xsT (because the photons 
obey Bose statistics; see (37.8)). For the components of this expansion, (79.2) 
gives the corresponding relation 


Diks; ri, r2) = Dki(— Ñs; r2, r1). (79.4) 


According to the general relation (37.12), these components are related to the 
retarded Green’s function by 


Diks; ri, r2) = DRC, ; re, r1) 
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for positive ¢,. It has been shown in §75 that the functions DÈ (w; 1, r2) may 
in a sense be regarded as generalized susceptibilities occurring in the general 
theory of the response of a macroscopic system to an external interaction. Hence 
there followed the symmetry property of these functions expressed (for non- 
magnetoactive media) by equation (75.12); because of the relation between 
Di and Dp the latter have a similar property, 


Dix(Os3 ris r2) = Dei Cs3 r2, r1). (79.5) 


From this equation together with (79.4), it now follows that the functions 
D (0,381) r2) are even functions of the discrete variable ¢,, so that for all its 
values (positive and negative) we have 


Dios; Ti, r2) m= D3} K(i les |; ri, r2). (79.6) 


Furthermore, the function DÈ (œ; rı, re), like any generalized susceptibility, is 
real on the positive imaginary w-axis (see Part 1, §123); it therefore follows from 
(79.6) that the function D;,(¢,; r1, r2) is real for all values of ¢,. Lastly, it follows 
from these properties that the original function D (T; rı, r2) is real and an even 


function of t: 
(DiklT; ti, r2) = Dix(—T; va, ro). (79.7) 


The relation (79.6) between the temperature Green’s function and the re- 
tarded Green’s function enables us to write down immediately the differential 
equation that must be satisfied by the function D, in an inhomogeneous me- 
dium; to do so, it is sufficient to replace w by i| ¢,| in equation (75.15) or (75.16). 
For example, with an isotropic non-magnetoactive medium having p = 1, 
we find the equation 


2 
Es — 6 A+—5 gs E(t beels r) ôu Dik os r, r’) = — 47chb,6(r — r’). (79.8) 


For a homogeneous infinite medium, the function D,(¢,; r, r’) is expanded as 
a Fourier integral with respect to the difference r—r’. The components of this 
expansion satisfy the algebraic equations 


ay [ek bk? — ve e(i | Diks, k) = Sir, (79.9) 
and are given byt 
oo Asch kik, 
Dil, k) = ~ Pei Cs EFE (ut mage D ED | ; (79.10) 


Since the function D,(¢,,k) is expressed (in the long-wavelength range 
ka « 1) in terms of e(w), the diagram technique for calculating it becomes a 


t In practical applications (cf. §80), the function D, always occurs as a product with C2, 
and this removes the divergence at, = 0. 
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technique for calculating the permittivity of the medium. The latter quantity 
also has a definite diagram significance, which will now be elucidated. 

We shall represent the exact D function by a thick broken line, and the va- 
cuum function D by a thin one:* 


— am oe ws = J), omen cue cue an SEE -D,' (79.11) 
The whole set of diagrams representing the 2 function can be expressed as a 
series exactly analogous to the series (14.3) for the function G: 


saree Simcha g E E p E a e 79:12) 


where a circle stands for the set of diagram blocks that do not fall into two parts 
joined only by one broken line; this set will be denoted by — D/4r. The func- 
tion D,, (analogous to the self-energy part of the Green’s function of the par- 
ticles) is called the polarization operator. 

The diagram equation (79.12) is equivalent to 


we eww ew ee IE ee oe ef _ — (m a n ; (79.13) 


cf. the derivation of (14.4) from (14.3). In analytical form, this is 
Dik = DR+ D Dim/ 4r) Dmr (79.14) 


where all the factors are functions of the same arguments ¢, and k. Multiplying 
this equation on the right by the inverse tensor D~! and on the left by [D9], 
we can rewrite it as 

D7, = [DO]; —Dikl4n. (79.15) 


Finally, taking D~',, from the left-hand side of equation (79.9), and a similar 
expression with e = 1 for [D] tp, we find 


Di Ss, k) = (05 /he?)[eE | Cs |)— 1] Sins (79.16) 


which determines the diagram significance of the function e(w)—1 at a discrete 
set of points on the positive imaginary w-axis. The analytical continuation of 
e(i |,|) to the whole of the upper half-plane must, in principle, take account 
of the facts that «(w) cannot have a singularity in this half-plane and that 
elw) > las|w| ~o.* 


t The use of broken lines to denote the D functions cannot cause any misunderstanding 
here, since this and the following section do not explicitly involve the energy of pair interaction 
of the particles in the medium (for which that notation was previously used). 

+ In an anisotropic medium, we must write 


Dila K) = (02 /fic*) [Eli | 0.1) — ôi]. 


In this form, the expression remains valid when there is spatial dispersion and £ depends on 
the wave vector as well as on the frequency. 
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In an inhomogeneous medium, the polarization operator is, like Dy, 2 
function of the coordinates of two points. Repeating the derivation in the coor- 
dinate representation, we obtain instead of (79.14) 


Dit1, r2) = DRE, t2)+ a | O ri, ¥3) Dim(T3, t4) Dmk(Ta, r2) 43x3 d°x4; 


the arguments ¢, are omitted, for brevity. Applying the operator 
o 


a 2 f r2 
0x1, 0x11 nA + (C3/c ) Ont 


to the left of this equation, and noting that D satisfies equation (79.8) with 
e = 1, we obtain 


J Dies, r’) Die’, r2) ax’ = [e(er)— 1183/4) Dix, r2), 


whence 


Dir(Ss3 ri, r2) = (S2/hc?) 6,6(r1— re) [eli] Esl, r1)— 1]. (79.17) 


The structure of a condensed medium, and hence its dielectric properties, 
are determined by the forces acting between its particles at distances of the 
order of the atomic dimensions a. At these distances (if the particle velocities 
are non-relativistic) we can neglect the retardation of the interactions, which 
becomes important only for the long-wave components (ka « 1) of the field. 
That is, in calculating the polarization operator we can neglect the long-wave 
part of the field. In the diagrams for the Green’s function D, itself, however, 
the long-wave field occurs only through the thin broken lines on the right of 
(79.12). 

The three-dimensional tensor D,, considered in this section is, of course, 
only the spatial part of the polarization 4-tensor D. We emphasize, to avoid 
misunderstanding, that its time component Doo and mixed components Po; 
are not zero. Moreover, as in quantum electrodynamics, this 4-tensor is inde- 
pendent of the gauge of the potentials. In non-relativistic theory the gauge in- 
variance is obvious from the possibility just mentioned of calculating the polar- 
ization operator with only the non-retarded forces, which are independent 
of the gauge of the long-wave field. 

The components Poo and Do; can be found from the condition for the 4- 
tensor to be transverse: D,,,k” = 0, where k” = (it, k) is the wave 4-vector: 


Deo = — ADG] 1), (79.18) 
Poi = (iC .k;/fc*) [ei | Cs |)— 1]. 
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§ 80. The van der Waals stress tensor 


Although the structure of condensed bodies is essentially determined (as 
noted at the end of §79) by the forces acting between its particles at atomic 
distances, a definite contribution to the thermodynamic quantities of the body 
(its free energy, say) comes also from the van der Waals forces which act be- 
tween the atoms at distances large compared with the atomic distances a. 
For free atoms, the energy of this interaction decreases with increasing dis- 
tance as r~® (see QM, §89), and as r~? when retardation effects have become 
important (see ROT, §83). In a condensed medium, of course, the van der 
Waals forces do not reduce to an interaction between separate pairs of atoms. 
However, since their range of action is large compared with interatomic dis- 
tances, we can use a macroscopic approach to the problem of their influence 
on the thermodynamic properties of the body. 

In the macroscopic theory, the van der Waals interaction in a material me- 
dium is regarded as brought about through a long-wavelength electromag- 
netic field (E. M. Lifshitz 1954); this concept includes not only thermal fluc- 
tuations but also the zero-point oscillations of the field. An important property 
of the contribution of this interaction to the free energy is that it is not additive; 
it is not simply proportional to the volume of the bodies, but depends also on 
parameters that characterize their shape and configuration. This non-additivity, 
resulting from the long range of the van der Waals forces, is the property that 
distinguishes their contribution to the free energy from the much larger addi- 
tive part. In the macroscopic picture, this property arises from the fact that 
any change in the electrical properties of the medium in some region causes, in 
accordance with Maxwell’s equations, a change in the fluctuational field even 
outside that region. In practice, of course, the non-additivity effects are appre- 
ciable only when the characteristic dimensions are sufficiently small (though 
still large compared with atomic dimensions), e.g. in thin films or in bodies. 
Separated by a narrow gap. 

In the calculation of the contribution of the electromagnetic fluctuations to 
the free energy, the important wavelengths in each case are of the order of the 
characteristic dimensions of the inhomogeneity of the medium (film thickness, 
gap width, etc.). In the macroscopic theory, this is the reason for the power-law 
decrease of the van der Waals forces; if fluctuations with some fixed wave- 
length 2o were important, this would give an exponential decrease, with expo- 
nent ~ rj2o. Since the characteristic dimensions, and therefore the character- 
istic wavelengths of the fluctuations, are much greater than atomic dimensions, 
all properties of these fluctuations and their contribution to the free energy are 
expressed entirely in terms of the complex permittivity of the bodies. 

Our object will be to calculate the macroscopic forces acting in an inhomo- 
geneous medium." As a first step in the derivation, we shall determine the change 


+ The theory given below is due to I. E. Dzyaloshinskii and L. P. Pitaevskii (1959). 
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in the free energy of the medium due to a small change in its permittivity; the 
magnetic properties of the material will be neglected (the permeability u = 1). 
We shall suppose that the change in e is caused by a small change 6A in the 
Hamiltonian of the system. Then the change in the free energy is 


ôF = (8A), (80.1) 


where the averaging is taken (with given temperature and volume of the system) 
over the Gibbs distribution with the unperturbed Hamiltonian Ê. We substitute 
the latter in the form’ 


H=AotVin Vw =- f j-Â dx, (80.2) 


where V’,,, describes the interaction of the particles with the long-wave electro- 
magnetic field, and Ao includes all other interactions together with terms cor- 
responding to free particles and photons. Strictly speaking, the integral in 
(80.2) should be regarded as cut off at a wave number ko « 1/a, but the cut-off 
parameter does not appear in the final result. The operator A is the long-wave 
field vector potential operator; it is important that the operator 6H responsible 
for the change in the permittivity does not contain A, since the permittivity is 
determined solely by the interaction of particles at atomic distances. 

Let us now change in (80.1) to Matsubara operators in what may be called 
the long-wave representation of the interaction: in this representation, the 
dependence of the operators on t is determined by all the terms in the Hamil- 
tonian except V,,. By the same method as in the derivation of (38.7), we 
obtain 


sake Mg 
ôF = On (T: ôĤ Gyo, 


a (80.3) 
ô = T exp | f jM. AM d3x dz,' 

oiy j 

where ¢...)o denotes averaging over the Gibbs distribution with the Hamilto- 


nian A. According to the meaning of the chosen representation, the Matsubara 
operators are defined as 


AM (cz, r) = exp (to) Â(r) exp (— tA), (80.4) 


and similarly for 6A™ and for the y operators from which the particle current 
operator j” is constructed.* Since Ao does not contain the interaction of the 
long-wavelength photons with anything else, A™ is the same as the (Matsubara) 
operator of the free photon field; for the y operators of the particles this is, of 
course, not so, since Ho includes the interaction between particles. 


t In this section we take = 1,c = 1. 
t The suffix O which should also be attached to the operators in this representation is 
omitted to simplify the notation. 
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Following the general construction principles of the diagram technique, we 
expand the exponential in (80.3) in powers of V,,.‘ In each term of the expan- 
sion, the product of the free-field operators A™ is averaged in the usual way as 
pair contractions, using Wick’s theorem. The zero-order term in the expansion, 
which does not contain A™, gives 5Fo, the change in the free energy without 
allowance for the long-wave fluctuations. The next term, linear in A™, gives 
zero on averaging. In the term quadratic with respect to the field, the contrac- 
tion of two operators (MAM) gives DR, the free-photon Green’s function; 
this term may be represented by the diagram 


ge tet. QD (80.5) 


with the numerical factor 1 /2!, which occurs in the expansion of the exponential, 
shown separately. The thin broken line denotes the function OD, and the 
shaded circle denotes the result of averaging all other factors. The explicit 
form of the latter quantity will not be given here; the only important point is 
that it is just 6D,,/4, where 5Dy,, is the change in the polarization operator 
when the Hamiltonian of the system changes by 6. This is easily seen by 
considering similarly the change in the function D. In the same representation 
of the operators, this function is 


1 F A 
DiyAT1, 013 T2, r2) = KON (T; AM (Ta, r1) Aff (T2, r2) ôYo, 


where now 


YT 
é= T, exp Í (—VéM—6H™) dz; 
0 


the “interaction” includes 5H as well as V’,,. The required change 5D, is 
given by the linear term in the expansion of this expression in powers of 6A™; 


ôDik = ie (rf 6 dt. AM(11, r1) AM(t2, r2) exp [pam d?x de \ : 


0 


(80.6) 


In the expansion of the remaining exponential in powers of P, the zero- 
order term is to be omitted, as corresponding to a detached diagram (the 
contraction (AA) is separated from the other factors, which do not contain 
the variables rı and r2). The first-order term contains an odd number of A 
operators and gives Zero on averaging. Lastly, the second-order term gives in 
5D,, an expression represented by the diagram 


D=- -Q (80.7) 


t It is sufficient to show the expansion of the numerator in the expression for ôF. As usual, 
the role of the factor (@)) in the denominator is just to exclude diagrams that separate into 
two or more disconnected parts. 
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with the same circle as in (80.5); the factor 4 does not appear here, because 
there are two ways of contracting the “internal” A operators from the opera- 
tors V,, with the “external” AM and AM. On the other hand, from the definition 
of the polarization operator, the Green’s function in the approximation con- 
sidered is given by the sum 


where the white circle is the polarization operator D,,/42. The variation of 
this function therefore gives the diagram (80.7) with 5D,,/42 as the shaded 
circle. 

All subsequent terms of the expansion in (80.3) are corrections of various 
orders to the broken line and the circle in the diagram (80.5). These corrections 
convert the broken line into the exact function D,,. The long-wave corrections 
to ôD, are small, as already discussed, so that we can immediately take 6D, 
to be the variation of the exact polarization operator. 

In analytical form, this result is written (after changing to the Fourier expan- 
sion with respect to the variable z)* 


§ = —oo 


p- e 1 
ôF = Fo — -5 eae 4 | Dus F1, T2) ra ôPDrkils; r2, r1) d?x1 d3x2. (80.8) 


According to (79.17), the change in the polarization operator is expressed 
(for an isotropic medium) in terms of the change in the permittivity : 


ôDkilÈs; ri, r2) = C3ôkiÂ(rı— ro)ôe(i | Čsl, r1); 


the delta function here eliminates one of the integrations in (80.8). Taking 
account also of the fact that D,, is an even function of ¢,, we can rewrite 
(80.8) as 


ôF = bFo— 2 [ CDu; r, r) ĉefi | Cs l, 4) d3x, (80.9) 
sany 


where the summation is taken only over positive s; the prime denotes that the 
term with s = 0 has an extra factor +. This term is finite, since the factor C* 
cancels the divergence of Dy at ¢, = 0. 

For the writing of further formulae, it will be convenient to introduce two 
further functions: 


Diets; r,r) = —O2D (Es; r, r’), l (80.10) 


DEC; r, r') = curly curlkm Dim(Ẹs; r, r’), 


t We shall not give the general rule for determining the sign of diagrams of the type (80.5) 
(without free external lines). In the present case, the sign is easily found by writing explicitly 
the corresponding terms in the expansions in (80.3) and (80.6). It is indeed sufficient to note 
that this term in (80.3) contains one contraction of a pair of A operators, and in (80.6) two 
pairs; since the contraction of one pair gives —- Dy, the diagrams (80.5) and (80.7) have 
opposite signs, and this leads to the minus sign in (80.8). 
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constructed analogously to (76.3) and (76.4). Then ôF can be written 


An s=0 


EE AT 5” [ DECC; r, r) de(i [čs |, r) d?x. (80.11) 


We now use (80.11) to determine the forces acting in an inhomogeneous 
medium. The isotropy of the medium has already been assumed; we shall now 
suppose that it is a liquid, so that the change of state at each point (at a given 
temperature) can only be due to a change in the density ọ. 

Let us imagine that the medium is subjected to an isothermal small deforma- 
tion with displacement u(r). The corresponding change in its free energy is 


ôF =— f| f.u dèx, (80.12) 


where f is the volume density of the forces acting on the medium. On the other 
hand, the same change can be determined from (80.11) by expressing the var- 
iations ôFo and ôe in terms of the same displacement vector. Let Po(o, T) be 
the pressure without allowance for the van der Waals corrections for given 
values of 9 and T; the corresponding density of volume forces is fo = — VPo, 
so that 

OF o = f u.VPo d?x. 


Next, the change in the density is related to the displacement vector by the 
equation of continuity 60 = —div (ọu). The change in the permittivity is 
therefore 

de = (de/00) 50 = — (e/o) div (ou). 
Substituting this in (80.11), integrating by parts over the whole volume of the 


body, and then comparing the resulting expression for ôF with (80.12), we 
find 


> ð 
f =—-yPa— zt  o grad [DECC r, r) Sl . (80.13) 
S = gms Al 


i 


This formula enables us, in particular, to determine at once the correction 
to the chemical potential of the body. To do so, we write the condition of me- 
chanical equilibrium f = 0, and use the fact that at constant temperature 


dP,(o, T) = (o/m) duce, T), 
where u0(0, T) is the unperturbed chemical potential of the body (m being the 
mass of a particle). Then the condition becomes ọVu = 0, where 


= at ae Oe 
u = Mole, Ora e Pals; r, r) ðo s (80.14) 


On the other hand, the condition for the mechanical equilibrium of any inho- 
mogeneous body is that the chemical potential should be constant throughout 
the body; it is therefore clear that (80.14) gives this potential. 
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The most complete description of the forces acting in a medium is given by 
the stress tensor o;,, which is related to the components of the vector f by 


fi = 804%. /OXx. (80.15) 


To bring the expression (80.13) to this form, we first rewrite it as 
_ OP T wu, a 
fi = kien z {(- 0p ze) Dü (r, o} 
- ZY eO i Dit n); 


for brevity, the arguments Ẹ, will not be written in the intermediate formulae. 
The first two terms already have the required form. The third term may be 
written as 


T ô 0 r 
= ve y fec gx EM A Dir, r ) 


separating the differentiations with respect to the two arguments of the func- 
tion D;(r, r), and putting r = r’ at the end of the calculation. The calculation 
makes use of the equations (see (79.8)) 


AifDnde, r’) = — 4706, 5(8—-'), 
AgDite, 0’) = —4708y,8(r—1’), 


where 
Ay = e(r) 8i +cUrlim CUrlm = C26(r) 641+ 5 x — 6A 
The resulting equation is (with r = r’) 
Ê pei? feDE Lpi pE 
E Di = 2 Anz [eD + DR] Bx, 2i 
and the final expression for the stress tensor is 
Delis, 
Cik = — Poôĝik— > x {za ete r)— o Moet ee |x 
ae r, r)+ efits, r) DCCs; r, r) 
=z l su DH (s; r, D+ DEC, r, o} ; (80.16) 


The formulae obtained, however, do not yet have a direct physical signifi- 
cance. The reason is that the function D,,(r, r’) tends to infinity as 1/|r—r’ | 
when r’ > r, as is easily seen by means of equation (79.8). This divergence arises 
from the contribution of large wave numbers (k ~ 1/|r—r’|), and is due only 
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to the invalidity of equation (79.8) for k = a. This difficulty can be avoided by 
not making an explicit cut-off at large k. We note that the short-wavelength 
fluctuations have no connection with the effects under consideration, which 
are due to the inhomogeneity of the medium. Their contribution to the ther- 
modynamic quantities at any given point in the body is the same for a homo- 
geneous medium and for a medium that is inhomogeneous but has the same 
value of e(r) at the point. To give the formulae a definite meaning that is in fact 
independent of the nature of the cut-off, we must therefore subtract appro- 
priately. The Green’s function D,,(¢,;r, r) is to be taken as the limit of the 
difference 

jim {Did Es3¥,¥')— Diks; r, r}, (80.17) 


where D,, is the Green’s function of an auxiliary homogeneous infinite medium 
whose permittivity is the same as that of the actual medium at the given point r; 
this limit is not divergent. To avoid further complication of the formulae, we 
leave them in the previous form and treat D, as denoting the difference (80.17). 
Here Po(e, T) is the pressure in an infinite homogeneous medium for given 
values of o and T. 

Both in formula (80.16) and in the equation (79.8) that determines the Green’s 
function D% the properties of the medium occur only through (if), the per- 
mittivity as a function of the imaginary frequency. In this connection it may 
be recalled that the function has a simple relation to the imaginary part of the 
permittivity for real frequencies: 

se ose 2 i w im e(w) 
eli) = 1+— | ore dw (80.18) 
6 
(see ECM, §62). We may therefore say that the only macroscopic characteristic 
that determines the van der Waals forces in a material medium 1s ultimately the 
imaginary part of its permittivity. 

Formula (80.16) has exactly the same form as the expression known in 
macroscopic electrodynamics for the Maxwell stress tensor in a constant 
electromagnetic field, the quadratic combinations of the components E and H 
being replaced by the corresponding functions — Dg and — D7. This analogy 
is not a very profound one, however: it does not signify that for a variable 
electromagnetic field as such there is a general expression for the stress tensor 
in an absorbing medium, containing only the permittivity as a characteristic 
of the medium. In the present case we have not an arbitrary electromagnetic 
field but the thermodynamic-equilibrium intrinsic fluctuational field in the 
medium. 
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§ 81. Forces of molecalar interaction between solid bodies. 
The general formula 


Let us apply the general formulae derived in §80 to calculate the forces of 
interaction between solids whose surfaces are a very short distance apart, 
this distance satisfying only the one condition that it is large compared with 
interatomic distances in the bodies. This enables us to treat the problem macro- 
scopically, regarding the bodies as continuous media and their interaction as 
being brought about by the fluctuational electromagnetic field. The important 
fluctuations are those whose wavelengths are of the order of the characteris- 
tic dimensions of the problem, namely the width of the gap between the 
bodies." 

The suffixes 1 and 2 will denote quantities pertaining to the two solids, and 
3 will denote those pertaining to the gap between them (Fig. 17). The gap will 


3 Re 
O “hs 
Ñ 


Fic. 17. 


be assumed plane-parallel, with the x-axis perpendicular to its plane, so that 
the surfaces of bodies 1 and 2 are the planes x = 0 and x = /, where l is the gap 
width. The force F acting on unit area of the surface of body 2, say, is calculat- 
ed as the momentum flux into the body through this surface. The flux is given 
by the component o,,,, of the electromagnetic stress tensor in the gap, taken at 
x= l. In a vacuum, £ = 1, and the expression for o,,. from (80.16) becomes? 


F = o) =~ X (DB (Eni 1, D+ DECC LD DEC 11) 
+ DE Cn; 1, D+ DEn; 1, D—DE (Ens 1, D}; (81.1) 


in this section the summation suffix will be denoted by n. 

Because the problem is homogeneous in the y and z directions, the functions 
D,(6,3 r,r’) depend only on the differences y—y' and z—z’ (the arguments 
y—y’ and z-z’ are not written out in (81.1)); D(C, q; x, x’) are the Fourier 
components with respect to these variables. Then 


Dion; r, r) = Í Dillons q; x, x) d*q/(2z)?. (81.2) 


t? The results in this section and §82 are due to E. M. Lifshitz (1954). 
+ In the intermediate calculations we put Å = 1,c = 1. 
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For the functions 2,,(£,, q; x, x’), equations (79.8) become (with the y-axis 
parallel to the vector q) 


5. a" ; ; 
(» =g) D(x, x’) = —4n8(x— x’), 


dx? 
wage Dy x sjt D (x, x’) = —4nô(x— x’) 
dx? y\*> dx xy, ’ 
WD xy(X, x')+ig ADyy(x, x'\/dx = 0, 
WD x(x, x’) +ig IDyx(x, x')/dx = —4n8(x— x’), 
where w = (e6°-+g?)"2, e= efit), and x’ acts as a parameter; the components 


D,, = D, = 0,since the equations for them prove to be homogeneous. The 
solution of this system reduces to that of the two equations 


2 

as z) D(x, x') = —4nô(x— x’), (81.3) 
d2 , Anw? 

(2 TA) D, (x, x’) =— at d(x — x’), (81.4) 


and D and D, are then determined as 


iq d 
Dxy(x, x)=— a ax Dy, (x, x’), } 

7 : : (81.5) 
DX, x’) = — = a Dy AX, x’) — pa d(x — x’). 


Here it must be taken into account that, from (79.5), 
D,Ar, t') = Dr’, r), and therefore D,x(q; x, x’) = De —q; x’, x). 
The boundary conditions, corresponding to continuity of the tangential 


components of the electric and magnetic fields, amount to requiring the con- 
tinuity of the quantities Dž, DZ, Di, Di, or, equivalently, of the quantities 


Zi 


Dyk Dik, curly Di, curler Dir. 


Using the first equation (81.5), we find that the following quantities must be 
continuous at the boundary : 


L Dy (81.6) 


Since we have in mind to calculate the stress tensor only in the region of the 
gap, we can immediately take 0 -- x’ < l. Inthe range 0 < x < Ithe functions 
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D „and D,, are determined by equations (81.3), (81.4) with € = 1, w = w; = 
= (¢2+4q?)'”. In regions 1 (x < 0) and 2(x > J), they satisfy the same equations 
with zero on the right (since here x # x’) and with respectively ¢1, wı and £2, 
W2 aS €, W. 

The subtraction needed according to (80.17) amounts to subtracting from all 
functions D, in the gap region their values for £1 = £2 = 1. In particular, 
therefore, we can immediately omit the second term on the right of the second 
equation (81.5), so that in the gap region 


Dese a L (81.7) 


iq d 
Ga w3 dx 


Dry = ~ we dx Tm 

Before going on to solve the equations, we should make one further comment. 
The general solution of equations (81.3) and (81.4) is f-(x—x’)+ft(x+~’). 
Using these equations with (81.7) and the definition of the functions Dj. and 
“DZ, we can show that the parts of the Green’s functions which depend on x+ x’ 
make no contribution to the expression (81.1) for the force. We shall not 
discuss this point further here, as it is already obvious from physical consider- 
ations: putting x = x’ in a solution of the form f*(x+.~’), we should obtain a 
momentum flux in the gap that varied with the coordinate, which would 
contradict the law of conservation of momentum. Henceforward we shall 
therefore include only the expressions for the parts D; of the Green’s functions 


that do not depend on x+ x’. 
Let us now determine the function D,» It satisfies the equations 
(wi—d?/dx*) D2AX, x’) = 0, x< 0, 
(w3— d?/dx*) D,A(x, x') = 0, x >, (81.8) 


(w2— d?/dx*) D(x, x') =—406(x—x'), O<x</. 
Hence we find 


Diz = A, x<0; Diz = BeH, x>l; 
Diz = Cie" t Coe- (2r fws) emil 2-7, O< xal. 


In the last expression we have used the fact that, according to the third equation 
(81.8), the derivative dD „/dx has a discontinuity of 4x at x = x’. Determining 
A, B, Cı and C2 (which are functions of x’) from the boundary conditions that 
D, and d@,,/dx are continuous, we obtain 


4r 
wsd 





27 
Dz = cosh wax x) e-malz-x | Qe x <], 
where 


Aa teens Lota) wat ws) 
(Wi—Ws)(Wo—Ws) ` 
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Subtracting the value of Dz for wy = we = w3 (and 1/4 = 0), we have finally 








Daz = = cosh w3(x— x’). 
Similarly, solving the equation for D,,, we obtain (after the subtraction) 
4 
Dy = ai cosh w3(x— x’), 
n 
Ay = 1—e2mat Est wi) (E2W3 + wa) 


(€1w3— W1)(E2W3— We) 
and, using (81.7), 
Dy = Dy = — (Ascig/€241) sinh w3(x— x’), 
Dx, = — (Ag?/C2w341) cosh w3(x— x’). 


Now, calculating the functions Di, and D4, and then transforming them 
in accordance with (81.2), and substituting in (81.1), we obtain 


re {f sl 1 
F(1) = | W3 (ata) q dq. 
0 


Finally, changing to a new variable of integration p with q = €,+/(p?—1), and 
returning to ordinary units, we have as the expression for the force F on unit 
area of each of the two bodies separated by a gap of width / 


1 


(sit pes) (Set pea) 2ptrl \ 77! aa 
a exp ( c ) | lap, (81.9) 


where sı = y(e&1—1 +p), S2 = V (e2—1+ p°), 6, = 2anT/h, £1 and £c being 
functions of the imaginary frequency w = if,; in this connection it should be 
remembered that e(if) is a positive real quantity that decreases monotonically 
from its electrostatic value £o at Č = Oto 1 at č = œ.t The positive values of 
F.correspond to attraction between the bodies. The integrand in each term of 
the sum (81.9) is positive, and for any given p and ¢, decreases monotonically 


t Formula (81.9) has been derived on the assumption that both bodies are isotropic. Its 
application to crystals therefore depends on neglecting the anisotropy of their permittivity. 
Although this is entirely legitimate in most cases, the anisotropy of bodies causes in general 
a specific effect, namely a torque tending to rotate the bodies relative to one another. 
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as l increases.’ Hence F > 0 and dF/dl < 0, i.e. the bodies (separated by a 
vacuum gap) attract each other with a force that decreases monotonically with 
increasing distance. 

The general formula (81.9) is very complicated. It can be considerably sim- 
plified, however, because the influence of the temperature on the interaction force 
is usually quite unimportant.* The reason is that, because of the exponentials 
in the integrands in (81.9), only those terms are important in the sum which 
have 6, ~ c/l or n ~ ch/IT. In the case /T/ch« 1 the important values of n 
are therefore large, and in (81.9) we can change from summation to integration 
over dn = hdl /2nT. The temperature then disappears from the formula, and 
the result is 


oo 





ne 2ptl\  ,)7 
cei a c ) J | ap at. aa 


According to the above discussion, this is valid for distances /« ch/T, and even 
at room temperatures the distances concerned are up to about 1074 cm. 
Formula (81.10) can be considerably further simplified in two limiting cases. 


§ 82. Forces of molecular interaction between solid bodies. 
Limiting cases 


Let us first consider the limiting case of “small” distances, by which we mean 
distances small in comparison with the characteristic wavelengths 2o of the 
absorption spectra of the bodies concerned. The temperatures that may be in 
question for condensed bodies are always small compared with the fiwo im- 
portant here (for example, in the visible spectrum), and the inequality /T/ch « 1 
is therefore always satisfied. 

Because of the exponential factor in the denominators of the integrand, the 
important range in the integration with respect to p is that where p¢lje ~ 1. 
Here p > 1, and therefore we can put sı ~ S2 ~ p in determining the principal 
term in the integral. In this approximation, the first term in the braces in (81.10) 
is zero. The second term, with a new variable of integration x = 2pCl/c, gives 


PF feat Det). 7 
FO = -gar | (x X e De h7! as C2) 
0 0 


t This is easily seen by noting that for s= 4/(e— 1 + p?) (and p = 1) we have the inequalities 
ep > s > pwhene > 1. 

t In speaking of the influence of the temperature, we are not referring to that due simply to 
the temperature dependence of the permittivity itself. 
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in this approximation, the lower limit of integration with respect to x is re- 
placed by zero.! 

In this case, the force is inversely proportional to the cube of the distance, 
as we should expect in accordance with the usual behaviour of the van der Waals 
forces between two atoms; see the next footnote. The functions e(i¢)— 1 decrease 
monotonically with increasing ¢ and tend to zero. Hence the values of Ẹ be- 
yond some £o no longer contribute significantly to the integral; the condition 
for / to be small means that we must have l < c/Co. 

We shall show how the change can be made from the macroscopic formula 
(82.1) to the interaction of individual atoms in vacuum. To do so, we formally 
assume that both bodies are sufficiently rarefied. Macroscopically, this means 
that their permittivities are almost unity, i.e. that ¢;—1 and e2.—1 are small. 
From (82.1) we then have with the necessary accuracy 


a | | xe *(e1— 1)(£2— 1) dx dl 


0o o0 


F = 


oe f [ex(it)— leat) — 1] ae. 


0 


Expressing e(i¢) in terms of im e(w) on the real w-axis, by (80.18), we obtain 


w102 iM E(w) IM E2(w2) 
=n M come -< A22 dt dwr dws 


(cof +0) (03+ C?) 
” aus (| f im €1(@1) im alwe) e(o) im tale} LA (82.2) 


This force corresponds to an interaction of atoms with energy 


im €3(@1) im &2(@2) da 


3h 
U(r) =— Saimin? {| mea dwz, (82.3) 


t An integral of the form 





a . Í x dx 
2 ae*— 
0 
varies only slightly, from 1 to 1.2, when a varies from oo to 1. We can therefore write (82.1) 
with sufficient accuracy in practice, as 


O AGP eit) — I fea) 1) 
8B? a= | leat) + 1) [e2(#2) + 1] d. 


The quantity @ acts as a frequency characterizing the absorption spectra of the two bodies. 
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where r is the distance between the atoms; ni, ng are the number densities of 
atoms in the two bodies.* This formula agrees with London’s formula in quan- 
tum mechanics, which is derived by applying ordinary perturbation theory to 
the dipole interaction of two atoms (see QM, §89, Problem). In making the 
comparison, it must be borne in mind that the imaginary part of e(w) is related 
to the spectral density of “oscillator strengths” f(w) by 


wim &(w) = (27°%e?/m) nf (©), 


where e and m are the electron charge and mass; see ECM, §62. The oscillator 
strengths are expressed, in the usual manner, in terms of the squared matrix 
elements of the dipole moments of the atoms; see QM, (149.10). 

Let us now turn to the opposite case of “large” distances, ] >> 2o. We shall, 
however, suppose that the distances are still not so large as to violate the in- 
equality /T/fic « 1. 

In (81.10) we again use a new variable of integration x = 2p/t/c, but leave 
p and not € as the second variable. Then ¢1 and ez are functions of if = ixc/2pl. 
On account of the factor e* in the denominators of the integrand, the important 
values of x in the integral with respect to x are ~ 1, and since p = 1 the argu- 
ment of e for large / is almost zero throughout the important range of values of 
the variables. Accordingly, we can replace £, and ex simply by their values for 
E = 0, i.e. the electrostatic permittivities ¢19 and ego. Thus we have finally 


(si0+p)(S20+p) on 
= a f j = {| pene J 
po = pe1o)(Sz0— pezo) - ai je k 


S10 = y (£&10— 1+p?), S20 = y/(E20— 1 + p°). 


(82.4) 


The law of decrease with distance as /~* here corresponds to the decrease of 
the van der Waals forces between two atoms with allowance for retardation 
(see below). 

Formula (82.4) reduces to a very simple expression when both bodies are 
metals. For metals, e(if) — œ as € + 0, and we can therefore take £o = æ. 
Putting £10 = £20 = œ, we obtain 


_ x dp dz m2 fc 
= r= saa | Pe) ~ m0 E (2) 


t If the potential energy of the interaction of atoms 1 and 2 is U(r) = —ar™ °, the total 
energy of pair interactions of all the atoms in two half-spaces separated by a gap of width / 
is Uwt = ~ annıng/ 12l. The force is F = dUwydl = ann,n,/6f°. This is the correspondence 
between (82.2) and (82.3). 
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(H. B. G. Casimir 1948). This force is independent of the nature of the metals, 
a property which does not hold at small distances, where the interaction force 
depends on the behaviour of the function (it) for all values of ¢ and not only 
at = 0. 

Figure 18 shows a graph of the function @,(¢0), which gives the attractive 
force between two identical insulators (e10 = €20 = £0); formula (82.4) is 
written as 





F= a f a a) PilEo)- (82.6) 





re) 02 04 06 O8 If, 
Fic. 18. 


The same diagram shows the function ¢,,,(eo), which gives the attractive force 
for an insulator and a metal (£10 = £o, €20 = ©), from the formula‘ 


m? fic Eo— 
F= EE) Alne (82.7) 
In (82.4) we make the transition to the interaction of individual atoms, as was 


done above for formula (82.1). For small co— 1, we have 
So—p © (€o—1)/2p, so—peo © (€o—1)(—p+ 3P~), 
and the integral (82.4) becomes 


fic 2 1—2p?+2p* 
F = -yje (6101) (620 Y | ezda | TI dp, 
1 


whence 


F= e ar (e10— 1)(€20—1). (82.8) 


t As € ~ 1, the functions pu and @im tend to 0.35 and 0.46 respectively, corresponding to 
the limiting forms (82.8) and (1) in the Problem. As £ — co, both functions tend to unity, cor- 
responding to (82.5). 
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This force corresponds to the interaction of two atoms with energy 


one 1%, (82.9) 


UO == A4nr* 


where «; and a» are the static polarizabilities of the atoms (co = 1+4sna). 
Formula (82.9) agrees with the result of the calculation in quantum electro- 
dynamics for the attraction of two atoms at sufficiently large distances, when 
retardation effects become important (see ROT, §85). 

Lastly, let us consider distances so great that /7/fic >> 1, the opposite inequal- 
ity to that required for the effect of the temperature to be negligible. In this 
case only the first term in the sum (81.9) need be retained, but we cannot 
immediately put n = 0 in this term, since there is an indeterminacy: the factor 
¢3 is zero, but the integral with respect to p diverges. This difficulty can be 
avoided by at first using instead of p a new variable of integration x = 2pC,I/c 
(so that the factor C3 disappears). Then putting ¢, = 0, we obtain 


oo 


EE A (e10+1) (e2 +1) p -1 
«163 f j eoe 1| dx. (82.10) 


0 
Thus, at sufficiently large distances the decrease of the attractive force becomes 
slower, and returns to the /~3 law, but with a coefficient that depends on the 
temperature; all subsequent terms in the sum in (81.9) decrease exponentially 
with increasing /. The condition /T/fic > 1 is essentially the classicality condi- 
tion (iw < T, where w ~ c/l). It is therefore natural that the expression (82.10) 
does not involve &." 


PROBLEM 


Find the law of interaction of an atom with a metal wall at “large” distances. 


SOLUTION. The interaction of an individual atom with a condensed body can be found by 
considering only one of the bodies (labelled 2, say) as a rarefied medium. Regarding é2)— 1 
as small and putting £19 = oo, we obtain from (82.4) 


_ ficlen—1) f aa, f dp _ 3helew—1) 
Fase | Pee | Gan 
0 1 


(1) 


If the atom-wall interaction energy is U = —aL~‘ (where L is the distance from the atom to 
the wall), the energy of interaction of atoms in a half-space separated from the wall by a gap / 
is Uws= —an/3l®, and the force F = dU,./d! = an/I*. Thus the value found for F corre- 
sponds to the attraction of an individual atom to the wall with the energy 


U(L) = —3a,fic/8L* (2) 
(H. B. G. Casimir and D. Polder 1948). 


t The formulae obtained in §§81 and 82 can be generalized to include the case where the 
gap between the solids is filled with a liquid and the case of a thin liquid film on a solid sur- 
face; see I. E. Dzyaloshinskii, E. M. Lifshitz and L. P. Pitaevskii, Soviet Physics Uspekhi 4, 
153, 1961 (= Advances in Physics 10, 165, 1961). 
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For the interaction of an atom with an insulating wall, the same method gives 


3fich, Ey9—1 


UL) =~ nL eti 


Dat (£10) 








with the function },; shown graphically in Fig. 18. AS €19 > 1 this function tends to the value 
23/30 = 0.77, corresponding to (82.8). 


§ 83. Asymptotic behaviour of the correlation function in a liquid 


Long-wavelength electromagnetic fluctuations lead also to certain specific 
properties of the correlation function of the density fluctuations in a homo- 
geneous liquid. 

The correlation function »(r) is determined (see Part 1, §116) in terms of the 
mean value of the product of the fluctuations of the particle number density 
n at two points in space by 


(6n(ri) ôn(r2)) = n6(r)+-7(r), r = r1—ro. (83.1) 


The correlation function is related to the interaction between particles, and 
its asymptotic behaviour at large distances is determined by the long-range van 
der Waals part of this interaction. Hence »(r), like the van der Waals forces, 
decreases as an inverse power of the distance (J. E. Enderby, T. Gaskell and 
N. H. March 1965). 

This, of course, also affects the properties of the Fourier components of the 
correlation function »(k) = »(k). If the only forces acting between particles 
in the liquid had a range of the order of the atomic dimensions a, »(r) would 
decrease exponentially with increasing distance, the exponent being ~ r/a.” 
In terms of Fourier components this means that »(k) would be a regular function 
of ka and could be expanded in even powers of ka when ka<« 1. The long- 
range forces, however, cause the occurrence in »(k) of a term »(k) that varies 
considerably even in the range k ~ 1/4» (not k ~ 1/a), where Ao (>>a) denotes 
characteristic wavelengths in the spectrum of the liquid. In the range ka « 1, 
the parameter kAy may be either small or large; the function »,(k) has a sin- 
gularity in this range. 

To calculate the correlation function, we use its relation to the second varia- 
tional derivative of the free energy of the body with respect to its density. By 
definition, this derivative is the function (r) that occurs in the expression 


ÔF = + | O(\ri—re}) dn(t1) ôn(r2) d3x1 d3xz (83.2) 


for the change in the free energy due to density fluctuations (at a given 


t The liquid under consideration is at a temperature T ~ @, where O ~ huja is the 
“Debye temperature” of the liquid, and far from the critical point. Near the critical point, the 
correlation radius increases indefinitely (see Part 1,§§152, 153). It also increases at low temper- 
atures, and for T < @ it is of the order of fiu/T (see §87 below). 
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temperature). The Fourier component ¢(k) = $(k) of this function is related 
to the required function »(k) by 


(k) = 1; (83.3) 


we —_ 

ng(k) 
see Part 1, (116.14). We mvst emphasize that this formula assumes the fluctua- 
tions to be classical, for which it is necessary that iw «T, where w is the 
frequency of oscillations with wave number k. With w ~ ku (where u is the 
velocity of sound in the liquid) we get the condition 


ku «T, (83.4) 


corresponding to distances r >> hu/T. 

The “regular” part of the function $(k), due to the short-range forces, can 
be expanded in powers of k; taking (when ka « 1) only the first term of the 
expansion and denoting it by b, we can write 


p(k) ~ b+¢1(k), (83.5) 


where $,(k) is the “singular” part that is now of interest. Because the van der 
Waals forces are relatively weak, @3(k) « b, and so the result of substituting 
(83.5) in (83.3) can be put in the form 


x(k) = = -1 5 ġı(k). (83.6) 


Since »(k) and ġı(k) are linearly related, the function v(r) at large distances is 
simply 
v(r) = —(T/nb?) (r). (83.7) 

The first (k-independent) term in (83.6) corresponds to a coordinate function 
constant X 6(r) due to the short-range forces (if their range of action is regarded 
as negligible). 

To determine (r), we start from formula (80.11) for the variation of the 
free energy. Putting there 


8e(it, r) = oe A=) Sn(r), (83.8) 


we see that the expression 


H C2Dn(Es; r, r) oe 





ar, 


t The constant b is expressed in terms of the thermodynamic quantities for the liquid by 
b = (1/n)\(OP/0n) 7p; see Part 1, §152. 


§ 83 Asymptotic Behaviour of the Correlation Function in a Liquid 349° 


is the first variational derivative of the free energy with respect to the density. 
For the second differentiation we must vary this expression in turn, obtainingt 


-5 Anrhe 2 sô Dults; r, r) ——— h (83.9} 


The function ‘D itself satisfies equation (79.8): 


E re ite | 
ae Oxi — ĝuA + P E(il sy r) ôn Dubs; r,r ) = —4nhôiô(r—r ), (83.10) 
and its variation gives the equation for the variation of D: 
02 2o , 
ec a iA +e &(ils) ôu] ôDik(ts; r, 0’) = -2 =e Befit s r) D(t; r, r’). 
(83.11) 


The solution of (83.11) may be written down at once by noticing that, from 
(83.10), the “unperturbed” function D,, is the Green’s function of this equa- 
tion; hence 


bDix( Es; r, r’) = f be(ils, r”) Du Cs3 r”, r) Dufte; r”, r) dex"; 


c3 
Anfic* 
here we have also used the fact that D,(r, r”) = D,(r’, r). Finally, substituting 
(83.8) here and the result in (83.9), we obtain the second variational derivative 


T ðe (il, 
oi(r) = TAAA , X te € (is) 





| Dink Es5 r1, r2), (83.12) 


with r = |ri—re|. This formula together with (83.7) gives the required general 
expression for the correlation function »(7) when r >> fu/T (M. P. Kemoklidze 
and L. P. Pitaevskif 1970). 

The condition (83.4) already assumed previously for the wave numbers is. 
equivalent to r >> fiu/T for the distances. If, simultaneously with this condition, 
we restrict the range of values of r by an upper limit also: 


he|T =r >> hu/T, (83.13) 
then large values of s are important in the sum, and the summation over 


discrete “frequencies” C, = 2nTs/h can be replaced by integration over 
ds = hdl /2nT: 





T fT 1 deity? 
v(r) = Pack | ce sd | CA DFC C 3 ris rye. (83.14) 


0 


t Only the function Du is varied. Varying € would lead to a term of the form constant 
x 6(r) in P(r), which does not relate to the long-range forces. 
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The function Dm is obtained from (77.6) by the substitution œ — it. Carrying 
out the differentiation and squaring, we have 


2 
Dim = em (142 z EAA a +a): 


ws" wi (83.15) 
w = rl +/e(il)/c. 


Substitution of (83.15) in (83.14) gives a fairly complicated expression, but 
this becomes simpler in two limiting cases. 

For “small” distances (r « Ao; cf. §81) the important range in the integral 
is € ~ c/Ao; then r¢/c «1, so that we can replace the exponential factor in 
(83.15) by unity, and keep only the last term in the brackets. We then find 


A , AT ( defit) 72 dt 
y(r) = 7’ A = Terabe | a eID) 9 re ho. (83.16) 
0 
The Fourier transform of this function ist 
v(k) = mPAKS/12, kg 21. (83.17) 


In the opposite case of “large” distances (r >> 29) the important range in 
the integral is Ẹ ~ c/r < c/o ~ wo. We can therefore replace (if) by its 
electrostatic value o and take (@¢0/0n)? outside the integral in (83.14). The 
integration is then elementary (and all the terms in (83.15) make contributions 
of the same order of magnitude). The result is 


23hcT £o \? 
= 7 a p aa 
W) = Bir’, B= ars ( = , ræ. (83.18) 


The Fourier transform of this function is 
v(k) = — (72/30) Bk4 log kAo, kho « 1. (83.19) 


§ 84. Operator expression for the permittivity 


In this section we shall derive a useful expression for the permittivity of a 
medium in terms of the commutator of the charge density operator (P. Noziéres 
and D. Pines 1958). This formula is analogous to Kubo’s formula, taking 
account of the specific nature of the electromagnetic field. 


t By direct integration in spherical polar coordinates in k-space, we obtain 


; d3k I'(v+2) sin 4v 
= ik .r—Ak = Pea E D 
L= a inr e ky — Cn)? Inip t3 


The integral needed to verify (83.17) is J;. The integral needed to verify (83.19) is dl, |dv 
with vy = 4, 
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We shall consider a homogeneous medium having both time and space dis- 
persion of the permittivity. This means that the induction D(t, r) depends on 
the values of the field E(t, r) not only at previous times but at other points in 
space. Such a dependence can be generally represented as 


D(t, 1) = Et,t)+ f [fat r) Ett, 9-1) d?x' de. (84.1) 
0 


For a monochromatic field in which E and D œ exp [i(k.r—wt)], this relation 
becomes 
D; = Eix(O, k) Ex, (84.2) 


where 


Eix(@, k) = bit f f fut, ¥’) e0 0) d3x' dr, (84.3) 
0 


We shall take only the case where the medium is not only homogeneous but 
also isotropic and without natural optical activity. Then the permittivity remains 
a tensor, but one that contains only the vector k. The general form of such 
a tensor Is 
kikk 

k? 





Eik = E(w, k) 





+ e(o, k) (èx- sae) , (84.4) 
The scalar functions ç and ¢, are called respectively the longitudinal permittiv- 
ity and the transverse permittivity. If E is a potential field, E = —Vv@, then for 
a plane wave it is parallel to the wave vector (E = —ikd@) and then D = &E. If 
the field is solenoidal (div E = ik.E = 0), E is perpendicular to the wave vec- 
tor, and then D = ¢,E. 

With this description of the properties of the medium, there is (cf. ECM, 
§83) no significance in dividing the mean microscopic current density ev (0 being 
the charge density) into two parts OP/Ot and c curl M, where P is the electric 
polarization and M the magnetization of the medium. Thus Maxwell’s equa- 
tions are 

1 OB 1 oD 


curl E TO ip curl B = -57 > 


without the introduction of the vector H as well as the magnetic induction B 
which is the mean microscopic magnetic field. All terms resulting from the aver- 
aging of microscopic currents are assumed to be included in the definition 
D = E+4aP, ov = ôP/ôt. 

The longitudinal permittivity is important in applications, and we shall derive 
an operator expression for it. This is found by considering the response of the 
system to an external (i.e. generated by sources outside the system) potential 
electric field E,, = — Y Pex 
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The operator of the interaction of the system with this field is 
v= f ôlt, r) dex(t, r) d5x, (84.5) 


where 6(t, r) is the charge density operator in the system. Comparing this expres- 
sion with the general formula (75.8) and regarding ¢,, as a “generalized force” 
f, we immediately find from formulae (75.9)-(75.11) that the Fourier compo- 
nents with respect to time of the mean charge density are 


O.(r) = -5 f [vee r) 6(0, r')— 6(0, r’) O(t, r)) GZ) d3x’ dt. 


Changing also to spatial Fourier components and using the fact that, since the 
system is homogeneous, the mean value of the commutator depends only on 
r—r’, we obtain 

Buk = (w, k) Ook; (84.6) 


where 


ao, k) = — f f eot-k- (r, x) Ô(O, 0)— (O, 0) ôC, x) d°x dt. (84.7) 


The mean charge density is related to the polarization vector of the medium 
by 6 = — div P (see ECM,§6). Hence, for the Fourier components, 


ok =—ik. Pok = i(e;— 1) k. E.,/42. 


On the other hand, Afex = —47¢,,, where ¢,, is the density of the charges that 
create the external field; the induction D is related to this charge density by 
div D = 4r., From these two equations we find 


ok = (4/k°) ook = (ie1/k?) k . Eoy. 


Finally, substituting these expressions in (84.6), we obtain the required expres- 
sion for the longitudinal permittivity : 


A sia al, k). (84.8) 
In (84.7) 6(¢,r) should be taken, strictly speaking, as the charge density operator 
of all particles in the system, both electrons and nuclei. Usually, however, the 
electrons make the principal contribution to the permittivity throughout the 
important range of values of w and k; we can therefore take 6 as e(A—ñ), where 
Ais the electron density operator and ñ its mean value. 
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Formulae (84.7) and (84.8) can be further transformed by expressing them 
in terms of the matrix elements of the Fourier components of the operator 6. 
To do so, we first rewrite (84.7) as 


a(o, K) =— ay | eal) OAO Dat, (849) 


where V is the volume of the system. The matrix elements of the Heisenberg 
operator 6,(¢) are expressed in terms of those of the Schrödinger operator by 


(0x(t)) mn T3 elma} Or) mn 


Expanding the product of operators by the matrix multiplication rule and inte- 
grating according to (31.21), we have finally 


ck een, E E 2 1 1 
E(w, k) =1+ hk2V » | (Cx)no| | saath Ss ToT | ; (84.10) 


where the suffix 0 refers to the given state for which the permittivity is sought. 
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Let us consider a fully ionized plasma, in which the ions form a classical 
(Boltzmann) gas and the electron component is degenerate. For this, the temper- 
ature must satisfy the conditions u «< T S u, i.e. 


hrBlm<«T S Anhima, (85.1) 


where u, and pu, are the chemical potentials of the electrons and ions in the 
plasma, m, and m, the electron and ion masses, n the particle number density; 
in making estimates, we do not distinguish between n, and n,. We shall also 
suppose that the plasma is almost ideal. For this to be so, the energy of the 
Coulomb interaction between two particles at a distance /~n~* apart must be 
small in comparison with their mean kinetic energy £. For ions e ~ T, and 
for electrons e ~ u, ~ n?Ph2/m,. Hence we have the conditions 


mee*|h? < n « Tje. (85.2) 


It has been shown in Part 1, §80, that under these conditions the chief source 
of corrections in the thermodynamic quantities of the plasma (as compared with 
their values for an ideal gas) is the exchange interaction of the electrons; the 
energy of this interaction (per unit volume of plasma) is ~ e2/n**. The corre- 
lation correction (the main one in a classical plasma) is small in a degenerate 
plasma, in the ratio ņ™” relative to the exchange correction, where n= 
m,&@|h?nY3 « 1. Nevertheless, its calculation for a degenerate plasma is of 
methodological interest, and affords an instructive illustration of the use of the 
diagram technique. 
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The Coulomb interaction operator of the plasma particles is 


= se 2 PiP el Viet. dx dx’, (85.3) 
where the suffixes aand b label the different kinds of particle (electrons and var- 
ious ions); z,e is the charge on a particle (for electrons, z, = — 1). Taking the 
y operators in the Matsubara representation, we obtain the interaction operator 
in that representation. The diagfam technique for calculating the mean value 
(P) (over the Gibbs distribution) is then carried through in the usual manner 
by changing to the interaction representation for Matsubara operators; the 
resulting perturbation-theory series is an expansion of (ÑY in powers of e2. 

The expression (85.3) contains no “free” variables (i.e. variables over which 
there is no integration). In the diagram technique, this is expressed by the fact 
that the terms in the perturbation-theory series for (ÑY are represented by dia- 
grams having no free external lines. The broken lines in these diagrams, with 4- 
momenta Q = (€,, q), will be arbitrarily associated with factors’ 


—$(q) = — 42/9? (85.4) 


(which are independent of ¢), i.e. minus the Fourier components of the poten- 
tial ġ(r) of the field of a unit charge. The continuous lines must now be assigned, 
together with the 4-momentum P = (Č, p), an additional suffix a which indi- 
cates the type of particle, and each such line is associated with a factor — Gos (P), 
which is minus the Green’s function of the free particles a. The continuous lines 
in the diagram form closed loops, each containing sections with the same a. 
Each vertex of the diagram (a point of intersection of a broken line with con- 
tinuous lines of type a) is associated with an additional factor z,e. Each fermion 
loop contributes an extra factor of — 1. The diagrams constructed according to 
these rules give terms in the expansion of 


—(2/V)(P). (85.5) 


The factor V in the denominator is the volume of the system. This factor arises 
because the integrand in each term of the series depends only on the differences 
of the coordinates, and therefore one of the integrations over d°x gives simply 
the volume V. The minus sign in (85.5) results from the determination of the 
broken lines by the rule (85.4), i.e. with the minus sign before ¢(q). The factor 2 
results from taking the factor + in (85.3) to the left-hand side. 

In the first order of perturbation theory, there are diagrams of two kinds: 


(a) Y (b) (i) (85.6) 


t In the rest of this section we put A = 1, c = 1, and e (> 0) denotes the unit charge. 
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with all possible a and b. Those of the form (85.6a) arise from contractions of 
y operators taken at the same point in space. These diagrams correspond to the 
direct Coulomb interaction of particles a and b distributed uniformly in space; 
their contributions cancel out on summation over all pairs a, b, because the 
plasma is electrically neutral. Diagrams of the form (85.6b) arise from contrac- 
tions of y operators with different arguments, and correspond to the exchange 
interaction of particles of one type a. The calculation of this diagram leads to: 
the results already obtained in Part 1, §80. 
In the next order, diagrams of the following kinds occur: 


a a OO 
bi |b o o {Y 1 2 (85.7) 
a C ' 


(a) (b) (c) (d) le) 


Diagrams (85.7a, b) are corrections to (85.6a) and for the same reason cancel 
on summation over all a, b, c. Diagrams (85.7c, d) are small corrections to the 
exchange interaction energy and are of no interest here. 

Diagram (85.7e) is “anomalously large” because the corresponding integral 
diverges. This divergence occurs because the momenta q of the two broken lines 
in the diagram are the same (as is obvious from the conservation of momentum 
at the vertices). Hence the diagram contains the integral f d3q/q*, which diver- 
ges as 1/q when q is small. 

In subsequent approximations there occur (as well as correction diagrams) 
new “ring” diagrams with even stronger divergence. For example, the third- 


order diagram 
a 


A> 
e (7i s 
b C 
ANS. 
with three broken lines having the same momentum q contains the integral 
f q~? d°q, which diverges as q~?. In general, the ring diagram of order n, formed 
by n continuous loops joined by n broken lines, diverges as g~ "—3), 
The summation of the infinite sequence of ring diagrams leads, as we shall 


see, to an effective cut-off of the divergences at values of g of the order of small- 
ness of e; hence all these diagrams together give a contribution to (V) that is of 
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the order of smallness of (e?)"/e2"~* = e3. This contribution is represented 
graphically by the sum (over kinds of particle) of the skeleton diagrams 


ae, (85.8) 


where the thick broken line represents the sum of the infinite set of linear 
diagrams 


a b 
ta E OO oa (85.9) 
0, Byars a b 


with various numbers of continuous loops. 

Whereas the thin broken line represents the potential ¢@ of the Coulomb 
field of an isolated charge, the thick one represents the potential (which we 
denote by ®) of the field perturbed by the polarization of the surrounding 
plasma. The total contribution (85.8) therefore gives the required correlation 
part of the mean interaction energy in the plasma. 

We use the notation —PD(C,, q)/4z for the sum of simple continuous loops of 
all types of particles, and denote this quantity graphically by a white circle: 


g 
== SO (85.10) 
a 


The argument ¢, of this function takes “even” values ¢, = 2sa7, whatever the 
Statistics obeyed by the particles a: by the law of conservation of frequencies at 
the vertex, this argument is equal to the difference of the frequencies of the 
two continuous lines, which is “even” for both “even” and “odd” terms. 

With the notation (85.10), the sum (85.8) is represented by one skeleton 
diagram 


2<Poer = (| (85.11) 


The thick broken line itself satisfies the diagram equation 
meme = = t Oe (85.12) 


which is exactly analogous to (14.4) and (79.13). In analytical form, this equa- 
tion is 


-Dn D = -00-00 PEE ot, o, 
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whence 
4r 
g’—D(Es,4) 


It is useful to regard these formulae from a somewhat different viewpoint, 
in order to establish the connection with the diagrams in §79. The Coulomb in- 
teraction between charges may be treated as resulting from the exchange of 
virtual photons. Here, however, it is more convenient to use not the gauge 
(75.1) but the Coulomb gauge (see ROT, §77), in which — Doo is the Fourier 
component of the Coulomb potential. The spatial part D,, in this gauge de- 
scribes the retardation and the magnetic interaction, and can be neglected in a 
non-relativistic plasma. We may therefore suppose that the broken lines in 
(85.11) correspond to the Matsubara Doo, and that the function D is just the 
component Doo of the polarization operator. According to (79.18) we can there- 
fore write 


D(t q) = (85.13) 


DÈ., Q = -glei | Le, )— 11; 


it is easy to see that the longitudinal permittivity e, occurs in (79.18) when there 
is spatial dispersion. Substituting this expression in (85.13), we find 


— ma 
Pekiles|,q) ° 


i.e. the Fourier component of the potential of a unit charge in the medium, as 
it should be. 

Expanding the diagram (85.11) by the general rules of the Matsubara tech- 
nique, we find 


D(a Q) = (85.14) 





i (os, 
(P Yeot = T3 TÈ Da $(q) D(C, W535. oe 
ee Presa) dq 
7 STS A DENT EF ii 


We shall see later that the term with s = 0 is the most important in the sum, 
and the corresponding integral is governed by the region of small q. Hence, 
in calculating (85.15), it is in practice sufficient to know the limiting value of 
(D(0,q) as q +0. This quantity is easily determined from simple physical consid- 
erations without any direct calculation from the diagrams (85.10). 

When ¢, = 0, the function (0, q) is the Fourier transform of the potential 
D(r) of the electrostatic field of a unit charge in the plasma. The unperturbed 
potential P(r) satisfies Poisson’s equation with a delta function on the right- 
hand side: Ad = — 426(r). The equation for the potential Ø perturbed by the pola- 
rization of the plasma is found by adding on the right-hand side the change 50 
in the charge density in the plasma caused by the field itself: 


A® = —4n[6(r)+ 80]. (85.16) 
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On the other hand, as q — 0 we have a field that varies only slowly through the 
plasma volume. In such a field, the thermodynamic equilibrium condition 


holds: 
Uat eZa® = constant = u®, (85.17) 


where u, is the chemical potential of particles of type a, and u its value in the 
absence of the field. From this condition we find for the change in the particle 


density n, 
Ôa = (OalONa)r,v Na = — eZ 


and hence for the change in the charge density 


Ôo = Ý ez,6ng = — Y (€2a)(Ona/OUa)r,v P. 


Substituting this expression in (85.16), we obtain the equation 
AD—#D = — 4n6(r), (85.18) 
with 
x? = Ane? S 22(Ona/Ous)r, v- (85.19) 


It is seen from (85.18) that 1/x is the Debye radius of the field screening in the 
plasma (cf. Part 1, §78). Finally, taking the Fourier component of each side 
of (85.18), we find 

Da) = 4n/(q?+x*), 


and a comparison of this with (85.13) gives 


IPO, Dla — o = — x. (85.20) 
Now integrating in (85.15) with this value of D, we have 
_ VT [ Ang’?dq _ VTx3 
Vy ecr = 22x) | GEFA => “ga . (85.21) 


First of all, we note that the integral converges at the lower limit and that the 
most important range is q ~ x. For the non-degenerate ion component of the 
plasma, ðn;/ðu; = n,/T; for the electrons, On,/Ou, ~ ne/le. It is easily seen 
that, from the conditions (85.2), x < n3, and so q « n", i.e. 1/q is large com- 
pared with the distances between particles. This justifies the use of the equilib- 
rium condition (85.17). To justify the neglect of all terms in the sum in (85.15) 
except that with s = 0, we note that by (85.14) the polarization of the plasma 
at non-zero frequencies is described by the permittivity ¢(w,q). According to 
the known asymptotic formula for high frequencies, ¢(w) ~ 1—420n,¢2/m,w?, 
and therefore 
E(t | Gs ) =] +4n,e?/m.C*; 
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see ECM, §59. From the conditions (85.1), (85.2), all the non-zero frequencies 
t, = 2snT >> (n,e?/m,)'”, and for them we can therefore take e(i|¢,|) = 1, i.e- 
the plasma is not polarized and P is small. 

Formula (85.21) is expressed in terms of the thermodynamic variables T, V, 
ua. Hence the thermodynamic potential Q of the plasma can be found by direct 
integration of the equation 


(02/8e%)r, v, ua = (Ve; (85.22) 


see Part 1, (80.4). The result for the correlation part of Q is (in ordinary units) 


5 Ua/nV Te 3/2 
Dasa E = ATE p 2 (sr | (85.23) 
a V,T 








127 Olla 


(A. A. Vedenov 1959). According to the general theorem of small increments, 
the same formula expressed in terms of other thermodynamic variables gives 
the correction to other thermodynamic potentials. 

For a non-degenerate plasma, all the derivatives 0n,/Ou, = n,/T, and (85.23) 
then becomes 
2Vave 


Feor = — — a 34/T 


(È 2n z?n a” (85.24) 


for the correction to the free energy, the same as in Part 1, (78.12). 

For strong degeneracy of the electrons in a plasma (T « p,), the derivative 
n,/Ou, ~ n./Ue<«n,/T. In the sum over a in (85.23) we can then neglect the 
electron term, and return to (85.24) with the difference that the sum is taken 
only over the kinds of ion in the plasma. Thus with strong degeneracy the elec- 
trons have no influence on the screening length or on the correlation part of the. 
thermodynamic quantities in the plasma. 


CHAPTER IX 


HYDRODYNAMIC FLUCTUATIONS 


§ 86. Dynamic form factor of a liquid 


THE correlation function of density fluctuations, discussed in Part 1, §116, is a 
particular case of a more general function which relates the density fluctuations 
not only at different points in space but also at different times. In the classical 
theory, this function is defined as the mean value 


no(t; ¥1,¥2) = (On(t1, r1) n(te, re)), (86.1) 


where t = t — ta; the factor nm = N/V, the mean number density of particles, is 
taken outside the definition of ø. For a homogeneous isotropic medium (a liq- 
uid or a gas) the function (86.1) depends on rı and rz only through the dis- 
tance r = |r;—re| between the two points, and this will be assumed in what 
follows. 

In the quantum theory, the corresponding function is defined by means of the 
symmetrized product of time-dependent (Heisenberg) density operators as 


nG(t, r) = $ (OA(t1, r1) A(t, r2)+ OA(f2, r2) 5A(t, r1), (86.2) 


in accordance with the general definition as in Part 1, (118.4). There are certain 
advantages in the present case, however, in using the asymmetric definition 


ño(t, r) = (6A(t, r1) OA(to, r2)) (86.3) 


for which we retain the notation o(t,r).’ Unlike &(t, r), o(t, r) is not an even 
function of t; it is evident that 


6(t,r) = tlelt, r)+o(—~t, r)]. (86.4) 


The Fourier transform of the function o(t, r) with respect to time and coordi- 
nates, 


a(o, k) = ofo, k) = | f eeteno(t, r) dt dx (86.5) 


t It is this function which is a directly observable quantity, for example, in inelastic scatter- 
ing of neutrons in a liquid (see Problem). 
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is called the dynamic form factor of the medium. Since o(t, r) is isotropic, the 
form factor depends only on the magnitude of the wave vector. It follows from 
(86.4) that the Fourier transform of &(t, r)is 


&(w, k) = 4 [o(@, k)+o(— o, k)]. (86.6) 


The purely spatial correlation of the liquid density fluctuations is determined 
by (86.1) with t = 0: o(r) = a(t = 0,r) = &(t = 0,r). This function is related 
to »(r), defined in Part 1 (§116) and used in §83, by o(r) = o(r)+ ô(r); the Fourier 
transforms are such that o(k) = »(k)+1. The function o(k) or x(k) is called the 
static form factor of the liquid. The functions o(, k) and o(k) are related by the 
integral formula 


ae | f ow, k) e-iot d z f A HZ. (86.7) 
t=0 


The Schrödinger (time-independent) density operator is given by the sum 
A(r) = F ô(r—ra), (86.8) 


taken over all the particles in the medium; the coordinates r, of the particles 
act as parameters; cf. (24.4). We shall need the components of the Fourier 
expansion of this operator with respect to the coordinates : 


Âk = f A(r) et d3x 
=F ew Ets, (86.9) 


The change to the time-dependent (Heisenberg) operator is made by the gen- 
eral rule 
A(t, r) = exp (iAt/h) A(r) exp (—iAt/h),' (86.10) 


where Å is the Hamiltonian of the system. This operator may be represented by 
the expressions; (86.8) and (86.9) with r, replaced by f(t), the Heisenberg oper- 
ators of the particle coordinates. 

According to the basic principles of statistical physics, the averaging ¢.. .) 
can be variously interpreted, according to the thermodynamic variables in terms 
of which the result is to be expressed. For example, if the function ø is defined 
for given total energy and number of particles in the system, the averaging is 
taken with respect to a definite (mth) stationary state, i.e. from the appropriate 
diagonal matrix element. For a homogeneous system (a liquid), the dependence 
of the matrix elements of the operator dA(t, r) on the time and coordinates is 
given by 

(m| S(t, r) |) = (m| 6A(0)|/) exp [i(Omt—Km-¥)], (86.11) 
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which is exactly analogous to (8.4); the right-hand side contains the matrix ele- 
ment of the Schrödinger operator 6/(r) taken at r = 0. Using this formula, we 


write 


no(t,r) = $, (m| 6A(t, r1) | 1) (/| 6A(t2, re) |m) 
i 


= Y Km] 6A(0) [J |? exp [i(@mut —Knut)]. 
l 


The Fourier transform of this function is 


fio(m, k) = (21) F |(m| 8A(O)| 1) |2 8o — orm) Òk — kim). (86.12) 
l 


The summation in these formulae is over all states of the system with a given 
number N, of particles, since the operator ôñ does not affect this number. 

If, however, we wish to express the form factor in terms of the temperature 
and chemical potential of the liquid, the expression (86.12) must also be aver- 
aged over the Gibbs distribution: 


no(o, k) = (27)t 2 exp (n) \(na| 6n(0)| D [ÔC — 1m) 6(k—kyn); 


(86.13) 


N, = N,, in each term of the sum. With a similar formula for o(—@, —k) = 
o(—w, k), interchanging the summation indices / and m and putting in the expo- 
nential factor E, = E,,+fw,, = E,,+tiw (because of the delta function), we 
find 

o(—, k) = o(a, k) e~*/T (86.14) 


and then, by (86.6), 
(w, k) = 4 (1+e7*/T) o(o, k). (86.15) 


It follows from (86.13) or (86.12) that o(@, k) = 0 for all values of the argu- 
ments. From (86.14) we have at zero temperature 


o(o,k)=0 for w<0, T=0. (86.16) 


In the macroscopic limit (N and V - œ fora fixed ratio N/V), the “palisade” 
of delta functions in (86.13) is smoothed into a continuous function, but the 
delta-function peaks in o(@, k) remain for values w = w(k) corresponding to 
non-decaying elementary excitations, as follows from arguments similar to 
those in §8. Such peaks occur, however, only for excitations without change in 
the number of particles.‘ 


t For example, in a Fermi liquid o(@, k) has a delta-function singularity at w = kup (up 
being the velocity of zero sound), but does not have such singularities corresponding to the 
fermion branch of the spectrum: see §91. 


$ 87 Dynamic Form Factor of a Liquid 363 


We shall show how the form factor of a liquid may be related to quantities 
occurring in the general formulation of the fluctuation-dissipation theorem 
(D. Pines and P. Noziéres 1958). 

Let each particle in the liquid be subject to an external field which gives it the 
potential energy U(t,r). Then the perturbation operator acting on the whole 
liquid is 

V(t) = | A(t, r) U(t, r) ax. (86.17) 


Taking a Fourier expansion of all the quantities here with respect to time, we can 
represent the response of the system, i.e. the mean value of the density change 
caused by the perturbation, as 


SĀ, r1) =— f a(@, [r1—r21) U(@, r2) d°x2, (86.18) 


where the function «(w, r) acts as a generalized susceptibility. The time Fourier 
component of the correlation function 6(t, r) is, in the notation of the fluctua- 
tion-dissipation theorem, 


nG(w, r) = (6n(r1) 6n(r2)).. r = rı1— r2. 


According to that theorem, this function is expressed in terms of the generalized 
susceptibility by 
(w, r) = Å coth (fiw/2T) im «(a, r). (86.19) 


A similar formula gives the coordinate Fourier component 6(a, k) in terms of 
a(w, k), and then from (86.15) we find the dynamic form factor 


2h 


1 uS eheo/T 


ño (œ, k) = im æ(w, k). (86.20) 


The importance of these formulae is due mainly to the fact that they establish 
a relation between the dynamic form factor and a function with known genera! 
analytical properties (as regards the variable w); for the function a(w, k) these 


properties are described in Part 1,§123. They also allow the use, in calculating 
the form factor, of the general formula (cf. (75.11)) whereby 


oo 


x(w, k) = ~ [ | eilot—k) (A(t, r) A(O, 0)— A(O, 0) A(t, r)) dt dex. (86.21) 
“5 

Expressing the density operators in terms of the y operators (n = ¥+¥), we 

can write this as a two-particle Green’s function, which may be calculated by 

means of the diagram technique. 
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PROBLEM 


Express in terms of the dynamic form factor the probability of inelastic scattering of slow 
neutrons in a liquid consisting of identical atoms (G. Placzek 1952). 


SOLUTION. According to the pseudo-potential method (see QM, §151), the scattering of 
slow neutrons may be described as the result of interaction with a potential energy 


U(r) = (2n*ht/M) af(r), (1) 


where A(r) is the density operator (86.8), M the reduced mass of the atom and the neutron, 
and a the slow neutron scattering length for a single atom, i.c. minus the limit of the scattering 
amplitude. The transition probability from an initial state 7 of the liquid + neutron system to 
a final state fin a range dv,is 


f Up(t) dt | dyy; (2) 


| 1 
dwy = EJ 
see QM, (40.5). For non-diagonal matrix elements Uy in (1), 62 may be written in place of A. 
The wave function of the initial state of the neutron (with momentum p and energy £) is nor- 
malized to one particle in the volume V, and that of the final state (momentum p’ and energy 
g’) by the delta function of p/2z. Then dv;= d3p’/(27h)?, and the perturbation matrix element 
is 

A = i(k-r—wt) J3 
Unt) = = ior > -l Stig(t, r) e d?x, 


where 4k = p- p’, iw = e — e’, and ĉng(t, r) is the matrix element with respect to the wave func- 
tions of the liquid. We substitute this expression in dw, and sum the transition probability 
over all possible final states of the liquid. The squared modulus of the integral is written as 
a double integral (over dt dt’ dx d3x’), and we use the fact that 


2, Ony(t, r) ng(t', r’)* = 2 Ont’, r’) Ony(t, r) 


= (il A(t’, r’) ACE, r) lÒ 
= no(t’—t,r’—r); 


a is expressed as a function of the total energy of the liquid in the state i. The integration over 
d(t’— t)d*(x’ — x) gives o(w, k), and a further integration (say, over dt d®x) gives just the volume 
Vand the total time interval ¢.;Omitting the factor t, we obtain as the scattering probability per 
unit time 

_ 4x he 


= Sqr iia’o'(a, hj); 


ae 6) 


This expression remains valid, of course, after being averaged over the Gibbs distribution, i.e. 
when the form factor is expressed in terms of the temperature. 

The property (86.16) of the form factor, as applied to the scattering of neutrons, expresses 
the fact that at T = 0 a liquid can only gain energy, not lose it. The relation (86.14) expresses 
the principle of detailed balancing, since scattering processes with energy and momentum 
transfer (w, k) and (— œw, — k) are inverse to each other. 


§ 87. Summation rules for the form factor 


The dynamic form factor satisfies certain integral (with respect to the fre- 
quencies w) relations called summation rules. 

The derivation of one of these is based on the commutation rule between the 
operators ñ,(t) and 7,(t). The commutator of Heisenberg operators taken at the 
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same instant is the same as that of the Schrödinger operators ñ, and 7,. The oper- 
ator ñy is determined by (86.9), and the required commutator is 


kt — nth, = —(ihi/m) kN, (87.1) 


where m is the mass of a liquid particle.‘ 
We start from the expression for the component of the Fourier expansion of 
the function o(t, r) with respect to the coordinates only: 


not, k) = f e~ik-m -rX S(t, rı) ON (t2, r2) L(x — x2). 
Since the integrand depends only on rı—rz, we replace the integration over 


d3(x1— x2) by one over d°x, d3x2/V; carrying out this integration within the 
averaging, we obtain 


a(t, k) = (1/N) (oft) OA_4(t2)). (87.2) 


We calculate the derivative Oo(t, k)/Ot at t = 0. Since a(t, k) depends only on 
the difference t = t, — tə, 





dolt, k) _ 1 (80 a0 
or >. 2 (a a) 


and, after substitution of (87.2), 
Ba(t, k)/Ot = (1/2N) (6m t1) 6A_1(t2) — Öñu(t1) öh (t2). 


Fach of the two terms here depends only on the absolute value of the vector k, 
and we therefore replace k by —k in the second term. Then, putting tı = te, and 
noting that #_, = nj, we find that the difference in the angle brackets is equal 
to the commutator (87.1). Hence 


[Oo(t, k)/Ot};.0 = —(if/2m)k?. 


On the other hand, expressing o(t,k) as a Fourier integral with respect to 
frequencies, we have 
a ee dw F dw 
= ]— miot = = —] — 
[Oo(t, k)/Ot),.0 | T f e`ietolw, k) On | i | wo(w, k) Fr" 


t=0 


90 —oo 


Comparison of the two expressions for the derivative gives the required relation 


r 2 
f wo(w, k) oe = a (87.3) 


t The calculation of this commutator is the same as the calculation used in QM §149 in 


deriving the summation rule (149.5); the number of electrons Z is here replaced by the total 
number of liquid particles N. 
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(G. Placzek 1952). It must be emphasized that this is valid for all k. When the 
classical limit (4 — 0) of this expression is taken, the integral on the left must 
be written as 

i dey 

| oloto, D-ou, ty) 2 


v 


0 


and we must put, in accordance with (86.14), 
a(w, k)—o(—, k) ~ (fw/T) olw, k). 


The factor fi then cancels on each side, leaving 


F dw 

2 2 = Tk 
f w°o(w, k) on Tk? /2m. 
0 


We can apply formula (87.3) to a Bose liquid at T = 0, and consider the range 
of small k. As k — 0, the principal contribution to the integral comes from the 
delta-function peak in the form factor o(w, k), which arises in (86.13) from tran- 
sitions creating one phonon; since there are no phonons in the ground state of the 
liquid, there are no transitions annihilating a phonon at T = 0. This term has 
the form 4ô(w — uk), where fuk is the phonon energy (u is the velocity of sound). 
Substituting it as o(œ, k) in (87.3), we find the coefficient 4, and the result is 


o(w, k) = (hk /mu) 6(w—uk). (87.4) 


Integration of this according to formula (86.7) gives the static form factor 
o(k) = fik/2mu (87.5) 


(R. P. Feynman 1954). Since this formula relates to the range of small k, its 
Fourier transformation gives the asymptotic expression for the correlation func- 


tion at larger: 
v(r) = —h/2x*mur'; (87.6) 


this may be verified by means of the integral given in the last footnote to §83. 
At T = 0, formula (87.6) is valid at arbitrarily large distances. At low but finite 
temperatures, it is valid up to distances r ~ hu/T, where the fluctuations cease 
to be purely quantum ones. At still greater distances, formula (87.6) is replaced 


t Formula (87.5), written in the form o(k) = #°k*/2me(k), where e(k) is the quasi-particle 
energy, is strictly valid only as k — 0. As k increases, there are increasingly important contri- 
butions to o(k) from transitions creating several quasi-particles. If this contribution is never- 
theless neglected, we may suppose that the formula gives the relation between the form factor 
and the energy of the quasi-particles in a Bose liquid. The maximum of a at k ~ 1/a (where a 
is the interatomic distance in the liquid) corresponds to the “roton” minimum on the curve 
of e(k). 
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by an exponential decrease (if the contribution of the van der Waals forces is 
neglected; see §83).' 

Another summation rule can be obtained from the relation established in 
§86 between the form factor and a generalized susceptibility «(w, k). This rela- 
tion is given by (86.20), which for T = 0(andw > 0) becomes 


fio(w, k) = 2h im a(c, k). (87.7) 
According to the Kramers—Kronig formulae (see Part 1, (123.15)), 


re « (w, k) -/p [E ae. 


Putting here w = 0, and noting that «(0, k) is real,* we get 
im a(o, k) 5- (87.8) 


In the limit k — 0, 
a(0, k + 0) = (0n/Ou)r—0 
= n(0n/OP)r-o. _ (87.9) 


This follows because, in a static weak field U varying slowly in space, the equi- 
librium condition is u+ U = constant, so that the switching-on of the external 
field is equivalent to changing the chemical potential by — U. In the limit 
k - 0, we therefore have from (86.18) 
én = —(On/Opu) U 

~ —U fa(0, r1—re) d? (x1— x2) 

= —Ua(0,k = 0), 
whence (87.9) follows. 

Thus, combining the formulae (87.7)-(87.9), we find the following summation 

rule for the form factor of a liquid at T = 0: 


oo 


1 dw On 


0 
(D. Pines and P. Noziéres 1958). 


t The correlation function (87.6) is negative (corresponding to repulsion between particles), 
unlike that of an ideal Bose gas, which is positive (see Part 1,§117). In this connection it may 
be recalled (§25) that in a slightly non-ideal Bose gas the energy spectrum has the phonon 
form only at k «< mu/f (with A/mu >> a). The corresponding distances r ~ 1/k >> h/mu, so 
that in the transition to an ideal gas (u > 0) the range of applicability of (87.6) moves away 
to infinity. 

+ The quantity a(w = 0, r) is real because of the general properties of the generalized sus- 
ceptibility. Then the Fourier component «(w = 0, k) is real because a(w, r) is an even func- 
tion ofr. 
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PROBLEMS 


PROBLEM 1. Find the correlation function v(r) in a Bose liquid at distances r = Au/T and 
temperatures T & 7). 


SOLUTION. The required correlation function is determined by the form factor for 
k ~ 1/r S T/hu « 1/a, for which the liquid has a phonon energy spectrum. When T = 0, 
o(w, k) contains a term in 6(@+ku) corresponding to phonon absorption, as wel las one in 
6(w — ku) corresponding to phonon emission. The coefficients in these terms can be found 
from (86.14) and (87.3): 


a(w, k) = zik [1 — e7 AeMIT]—1 (6(w— ku) + e-r? 6(w + kuw}. (1) 


Integration of this expression gives 


hk Aku 
o(k) = Smu coth 57 (2) 
and hence 
d?k 
= ik. OLE 
v(r) f rah) as 
A r seis hku 
mee fe PCO ag de 


By completing the contour of integration with respect to k by an infinite semicircle in the upper 
half-plane of complex k, we reduce the integral to a sum of residues at the poles (which lie on the 
imaginary axis). For r >> fu/T, the main contribution to the integral comes from the residue 
at the pole #ku/2T = in: 
2nT3 2nTr 
O = nair O (ia) c 
With the condition aT/hu « 1, the characteristic decay length for the function »(r) is much 
greater than the interatomic distances over which effects decay that are due to the direct inter- 
action between atoms. In formula (3), A is essentially involved, and the correlation described 
by this formula is therefore a quantum effect. In the derivation, the contribution from the 
van der Waals forces has been neglected. It follows from the results of §83 that this contribu- 
tion has a power-law form and predominates at sufficiently large distances. The distances at 
which (3) is replaced by (83.16) depend on the specific relation between the coefficients, but a 
range of applicability of formula (3) always exists at sufficiently low temperatures, since at the 
limit of this range with r ~ Au/T, according to (3) v oc T4, and according to (83.16) » oc 77. 


PROBLEM 2. Find the limiting form (at large distances) of the correlation function for the 
fluctuations of the condensate wave function in a Bose superfluid (P. C. Hohenberg and P. C. 
Martin 1965). 


SOLUTION. In the long-wave limit, the strongest fluctuations are those of the condensate 
wave function ®, since they involve only the relatively small energy of the macroscopic super- 
fluid motion. The corresponding contribution to the total thermodynamic potential 2 of the 
liquid (in a volume V with given T and u) is 


1 š Ae, 
6Q = fzer dV = sat | (voy dV. 


Expressing 6® as a Fourier series, 


ôD = > ôD, efkr, 6®_, = dB} ’ 
k 
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we obtain 
2 


i 
6Q = at evy k? 18®,!? 





and hence the mean square fluctuations 


(189. = Tm?/Vho,k?; (4) 


the calculations are exactly similar to those in Part 1, § 146. The contribution of these fluctu- 
ations to the one-time correlation function 


G(r) = (820) E0) 


G(r) ~= nDO) Dr) = Tnom?/A4rnho,r. 


Thus G(r) decreases by a power law at large distances. The contribution frcm fluctuations 
of the condensate density no decreases exponentially there. The two contributions are com- 
parable at r ~ r,; at distances r « r, they jointly follow (at temperatures near the A-point) 
the law 


G(r) œ rot, (6) 


where ¢ is the appropriate critical index. The correlation radius r, may be defined as the dis- 
tance at which the asymptotic form (5) is replaced by (6): 


r; © o,fno. 


With the critical indices B and v, which describe the temperature dependence of n, and r, 
according to (28.1) and (28.3), we find that 


Q; © (T3 z T)», 


With the known relations between the critical indices a, B, v, € (see Part 1, §§ 148, 149), we can 
easily see that this result agrees with (28.4). 


§ 88. Hydrodynamic fluctuations 


In the preceding sections, we have considered density fluctuations in a liquid 
for any frequencies w and wave vectors k. Here, of course, the actual form of 
the correlation function could not be found in the general case. However, this 
can be done in the hydrodynamic limit, where the wavelength of the fluctuations 
is large compared with the characteristic microscopic dimensions (interatomic 
distances in a liquid, mean free path in a gas). . 

The calculation of the one-time correlation functions of fluctuations of den- 
sity, temperature, velocity etc. in a liquid at rest calls for no special study: these 
fluctuations (in the classical or non-quantum limit) are described by the usual 
thermodynamic formulae, which are valid for any medium in thermal equilib- 
rium. The correlations between fluctuations at the same time at different points 
in space are propagated to distances of the order of interatomic distances (here 
we neglect the weak long-range van der Waals forces). But, in hydrodynamics, 
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such distances are regarded as infinitesimal. In the hydrodynamic limit, there- 
fore, the fluctuations at the same time at different points are uncorrelated. This 
statement follows formally from the additivity of a thermodynamic quantity, 
the minimum work R,,, needed to cause the fluctuation. Since the probability 
of the fluctuation is proportional to exp(— Rmin/T), by representing Rmin as a 
sum of terms pertaining to various physically infinitesimal volumes we find 
that the probabilities of fluctuations in these volumes are independent of one 
another. 

Using this independence, we can immediately rewrite the known formulae 
(see Part 1, §112) for the mean square fluctuations of the thermodynamic 
quantities at a given point in space as formulae for the correlation functions. 
For example, from the formula ((6T)?) = T?/oc,V for the temperature fluctu- 
ations in the volume V (where ọ is the density, and c, the specific heat per unit 
mass of the medium), we first write 


(6T(ta)dT(to)) = (T?/eCVa) ab, 


where the fluctuations relate to two small volumes V, and V,. Then, as the 
volumes tend to zero, we obtain? 


(8T (r1) ÒT (r2)) = (T2/0cy) 6(t1—F2). (88.1) 


Similarly, we have the following formulae for the fluctuations of other ther- 
modynamic quantities : 


(60(r1) 60(r2)) = eT(80/OP)r (rı—r2), (88.2) 
(6P(r1) 6P(r2)) = eT(OP/00)s d(r1—F2) 

= oTu?ô(rı—r2), (88.3) 

(6s(r1) 6s(r2)) = (cp/0) ô(rı—r2), (88.4) 


where P is the pressure and s the entropy per unit mass of the medium; the 
fluctuations of the pairs ọ, T and P, s are independent. We can also write a for- 
mula for the fluctuations of the macroscopic velocity v of the liquid (which is 
zero in equilibrium): 


(ôvi(r1) 6v¢(t2)) = (T/e) dix Ô(rı—r2). (88.5) 


A problem specific to hydrodynamics is that of the time correlations of the 
fluctuations, as is that of fluctuations in a moving liquid. The solution of these 


t This and subsequent formulae for one-time correlations in gases are valid for fluctuations 
with wavelengths large only in comparison with intermolecular distances but not necessarily 
with the mean free path. The latter condition is, however, required for different-time correla- 
tion functions in the hydrodynamic approximation (since the microscopic mechanism of prop- 
agation of perturbations in gases is determined by the mean free path of the particles). 
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problems calls for the consideration of dissipative processes (viscosity and 
thermal conduction) in the liquid. 

The construction of a general theory of fluctuation phenomena in hydrody- 
namics amounts to setting up the “equations of motion” for. the fluctuat ng 
quantities. This can be done by adding the appropriate terms to the hydrody- 
namic equations (L. D. Landau and E. M. Lifshitz 1957). 

The equations of hydrodynamics, written in the form 





00/Ot+div (ov) = 0, (88.6) 
Ov; Ov; = oP oik 
e prte Oxx = ~ Ox; Ox: : Sa 
Os _ 1 , (0% Ok i 
oT (ar +7) = zlik loa, +x) —div q, (88.8) 


with no specific form of the stress tensor o,, and the heat flux vector q, simply 
express the conservation of mass, momentum and energy. In this form they 
are therefore valid for any motion, including fluctuational changes in the state 
of a liquid. Then 0, P,v, ... are to be interpreted as the sum of the values of 
Go, Po, Yo, ... in the basic motion and their fluctuations 60, ôP, dv, ... (the 
equations can, of course, always be linearized with respect to the latter). 

The usual expressions for the stress tensor and the heat flux relate them 
respectively to the velocity gradients and the temperature gradient. When there 
are fluctuations in a liquid, there are also spontaneous local stresses and heat 
fluxes unconnected with these gradients; we denote these “random” quanti- 
ties by s, and g. Then 


, _ (00; Ov, 2 5 <a . 
Oik =N Cai a 3 6ix div v) +l6j, div VH Sik, (88.9) 
q = —xyT+8g, (88.10) 


where 7 and ¢ are the viscosity coefficients, and x the thermal conductivity. 

The problem now is to establish the properties of s, and g as regards their 
correlation functions. For simplicity, the arguments will be given for the nor- 
mal case in hydrodynamics, that of non-quantum fluctuations; this means that 
the frequencies of the fluctuations are assumed to satisfy the condition iw « T. 
The viscosities and the thermal conductivity are assumed non-dispersive, i.e. 
independent of the fluctuation frequency. 

In the general theory of fluctuations given in Part 1, §§119-122, a discrete 
sequence of fluctuating quantities x1, x2, ..., is considered, whereas here we 
have a continuous sequence, the values of 9, P, ... at each point in the liquid. 
This unimportant difficulty is avoided by dividing the volume of the body into 
small but finite portions AV and considering certain mean values of the quan- 
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tities in each portion; the change to infinitesimal volume elements is made in 


the final formulae. 
We shall take formulae (88.9) and (88.10) as the equations 


Ža = — 9} YabXo + yo (88.11) 
b 


of the general theory of quasi-stationary fluctuations; see Part 1, (122.20). 
The quantities x, are taken to be the components of the tensor o,, and the 
vector q in each portion AV. Then the quantities y, are S; and g: 


Xa => Oik, Gis | (88 12) 
Ya > Siks 8i. 


The significance of the thermodynamically conjugate quantities X, is deter- 
mined by means of the formula for the rate of change of the total entropy S 
of the liquid. We find in the usual way (cf. FM, §49) from (88.8)-(88.10) 


Oik [0V 40% q.VT 
$= f {sr ( (a +t) S| dv. (88.13) 


Replacing this integral by a sum over the portions AV and then comparing 
with the expression 


== — J ŽaXa, 
we find that 
1 Ov; Ov; 1 OT 
¥< Sorlent ae) MW, aa SV (88.14) 


It is now easy to find the coefficients y,,, which immediately give the required 
correlations by 
(Yaltı) Yo(t2)) = (Vab + Vba) O(t1— t2); (88.15) 
see Part 1, (122.21a). 
First of all, we note that in formulae (88.9) and (88.10) there are no terms 
which would relate o,, to the temperature gradient, or q to the velocity gra- 
dients. This means that the corresponding coefficients y,, = 0, and from (88.15) 


(siti, r1) Bi(t2, r2)) = 0, (88.16) 


i.e. the values of s and g are uncorrelated. 

Next, the coefficients relating the values of q; to those of (AV /T*) OT/Ox;, are 
zero, if these quantities are taken in different portions AV, and are yy, = 
= xI6,,/AV if taken in the same AV. With these values of y,,, we obtain 
from (88.15), after taking the limit 4V — 0, 


(gi(t, rı) gr(te, r2)) = 2xT?6;,0(¥1—Fe2) ô(tı— f2). (88.17) 
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Similarly we obtain formulae for the correlation functions of the random 
Stress tensor: 

(Sit{t15¥1) Stm(to,¥2)) = 2T[N(Snðkm + SimOx) + (S— $ N) ix51m) (81 — r2) 6(t1— t2). 
(88.18) 

Formulae (88.16)-(88.18) in principle solve this problem of calculating the 
hydrodynamic fluctuations in any particular case. The solution procedure is as 
follows. Regarding s, and g as given functions of coordinates and time, we 
formally solve the linearized equations (88.6)-(88.8) for ĉo, dv, ..., with the 
necessary hydrodynamic boundary conditions. This gives the quantities ex- 
pressed as linear functionals of Sẹ and g. Correspondingly, any quantity 
quadratic in 60, dv, ... is expressed in terms of quadratic functionals of sy, 
and g, and the mean value is calculated from (88.16)-(88.18); the auxiliary 
quantities s,, and g do not appear in the result. 

We can also write out formulae (88.16)-(88.18) in Fourier components with 
respect to frequency, and we shall do this in a form which generalizes them to 
the case of quantum fluctuations. According to the general rules of the fluctua- 
tion-dissipation theorem, such a generalization is obtained by including an 
extra factor (fiw/2T) coth (iw/2T) (which is unity in the classical limit iw « T).. 
In the presence of dispersion of the viscosity and thermal conductivity, the 
quantities 7, ¢ and x are complex functions of the frequency; in the formulae 
for the fluctuations, they are replaced by the real parts of those functions: 


(PeP) = 0, (88.19) 
(gg), = Siôl — r2) hwT coth (fiw/2T) re x(w), (88.20) 


(sPs@), = fiwd(r1—re) coth (žw/2T)X 
X[(Sdkm+ OimOxt— F Ôikôim) re n(o) +Ò ixd1m re C(w)}. (88.21) 


§ 89. Hydrodynamic fluctuations in an infinite medium 


In this section we shall consider hydrodynamic fluctuations in an infinite 
liquid at rest. This problem can, of course, be solved by the method given in 
§88. Here, however, we shall use a different method, and thus exemplify an 
alternative way of solving problems of hydrodynamic fluctuations. 

This other way employs the general theory of quasi-stationary fluctuations 
in its earlier stage, before the introduction-of the random forces. The relevant 
general formulae are as follows (see Part 1, §122). 

Let 

Ša = — È AabXb (89.1) 
q 


be the macroscopic “equations of motion” for the set of quantities x,(t) which 
describe the non-equilibrium state of the system (in equilibrium, all the x, = 0). 
These equations are valid if the x, are large compared with their mean fluc- 
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tuations (but also so small that the equations of motion may be linearized). 
We can then say that similar equations are satisfied (when ż > 0) by the corre- 
lation functions of the fluctuations: 


= xal) xA0)) = -Y Aaa XA), t > 0 (89.2) 


The initial condition for these is formed by the equations 
[(xa(t) xc(0))]r=40 = (XaXe)s (89.3) 


where (x,x,) is the one-time correlation function, assumed known. In the range 
t < 0, the correlation functions are continued by the rule 


(Xa(t) xe(0)) = +(xa(— t) xe(0)), (89.4) 


the upper sign relating to the case where x, and x, are both even or both odd 
under time reversal, and the lower sign to the case where one is even and the 
other odd. The solution of equation (89.2) with the condition (89.3) is obtained 
by means of a one-sided Fourier transformation: multiplying the equation by 
e' and integrating with respect to t from 0 to oo (with integration by parts on 
the left-hand side), we obtain the equations 


—iw(xax)Gr? = — £ Aab(XbXe)G + (XaXe) (89.5) 
Tr 
for the quantities (functions of frequency) 
(xax? = f etatai) x(O) dt. (89.6) 
0 


The ordinary Fourier components of the correlation function are expressed in 
terms of the quantities (89.6) by 


(XaXb)o = f elmt( x(t) xp(0)) dt 


= (xaxs)GO+[(xax)G?]" 
= (XaXb) G + (xexa)2; (89.7) 


where the signs + correspond to those in (89.4). 

Proceeding to the stated problem of fluctuations in a liquid at rest, we first 
linearize the hydrodynamic equations (88.6)-(88.8) with o;, and q from (88.9) 
and (88.10) (without the final terms). Putting @ = e@0+60, v = ôv, ..., and 
Omitting the non-linear terms we find 


060 "EISS 
eTii ọ div v = 0, (89.8) 
o Ov/Ot = —VbP+nav+(l+4n) V div v, (89.9) 
OS a are (39.10) 


“Or oT 
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the suffix 0 to the constant quantities go etc. is omitted after the linearization. 
In equations (89.8)-(89.10) it will be convenient to divide the velocity at once 
into the potential (“longitudinal”) and rotational (“transverse”) parts v and 
v defined by 

v= Oty, (89.11) 


div v™ = 0, curl v® = 0. 


In (89.8) only the longitudinal velocity occurs: 


06g 


aE +o div ¥ = 0, (89.12) 


and (89.9) separates into the two equations 





ov) N 
een ) 
a = Dp OM: (39.13) 
D ` 
o z =—yôP+ (+37) 7 div vD., (89.14) 


The equation for the transverse velocities is independent of the other equa- 
tions. Accordingly, we also have one equation for the correlation function of its 
fluctuations: 


E(t r) v(0, 0))— valot, r) 00, 0)) = 0, - (89.15) 


where v = n/o is the kinematic viscosity. Taking the one-sided Fourier trans- 
form, we obtain 


— io( vfr) EAO) vaP) VOE = P) O), 


where the right-hand side is the one-time correlation function; or, with the 
Fourier components with respect to the coordinates, 


(WP Pok = (01? OP )/ (Pk? — io). 


The one-time correlation function of the velocity fluctuations is given by (88.5); 
changing to Fourier components and separating the transverse part, we have 


(vf of), = (T/0) (On — kik /k?). (89.16) 
Substituting in the preceding formula, we have finally* 
i pk? 
(vf) V ()) ok = 2 re (of? vG = = (ou-e) et ki . (89.17) 


For the other variables we have a system of coupled equations (89.10), 
(89.12), (89.14). This becomes simpler, however, in the limiting cases of high 


t It is easy to see that, on integrating (89.17) with respect to w/2:1, we return to the one-time 
correlation function, as is to be expected. 
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or low frequencies. The reason is that the perturbations of longitudinal velocity 
and pressure are propagated in the liquid with the velocity of sound u, and those 
of entropy according to the equation of thermal conduction. The latter 
mechanism requires a time ~ 1/yk? for propagation of the perturbation toa 
distance ~ 1/k (where y = x/oc, is the thermometric conductivity of the 
medium). Hence, for frequencies that satisfy (with a given value of the wave 


number) the condition 
yk? «æ w ~ ku, (89.18) 


we may suppose that only v and P fluctuate at constant entropy. On the 


other hand, if 
yk? ~w< ku, (89.19) 


there are isobaric fluctuations of entropy.' 

Let us first consider the high-frequency range (89.18) and determine the 
fluctuations of pressure, for example. 

Equation (89.14), written for the correlation functions, has the form 


- (v(t, r) ôP (0, 0)) = — grad (ôP (t, r) ôP(0, 0)) + 


pi ( 4 $ n) grad div (v(t, r) ôP(0, 0)), 


and the initial condition for it is the vanishing of the one-time correlation of 
v and ôP. With a one-sided Fourier transformation with respect to time and 
a complete transformation with respect to coordinates, we hence have 


—iwe(v SP) = —ik(6PAGP—(6+47) kk. VSP). (89.20) 
Next, in equation (89.12) we write 


_ (ĉe 3o 2 sp 2 (Or 
ae (ap). ôP+ (a) 09 = ga OP 5 (ap) & 


and 06s/0t is expressed by means of (89.10) in the form 


ods x (P 


oF ap), oP: 


the term in Ads on the right is neglected in comparison with 06s/dt, since 
yk? << w. This leads to the equation 


1 ðP xo (zy 


iv yO = 0, 
5P AôP-+ o div v 0 


— — a a 


s 


t The inequality yk? < ku is always satisfied in the hydrodynamic region. For example, in 
gases u ~ Vr and y ~ vrl, where vz is the mean thermal velocity of the particles and / their 
mean free path. Hence the inequality yk « u is equivalent to the necessary condition kl « 1. 
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The corresponding equation for the correlation functions is again obtained by 
replacing ôP and v® by (6P(t, r) 6P(0, 0)) and (v(t, r) ôP(0, 0)) respectively; 
the initial condition is (88.3). After the Fourier transformations, this equation 
becomes 

| io  k?xo (3 


-rt T F). i (SPDS + io(k .vOSP)SY = eT. (89.21) 


From the two equations (89.20) and (89.21) we can find 
k20TuA(i+ 2ypra/uk?) 





(+) — eS NE Ay AAA 
(6P?).4, = 2 re (PYLE 2 re coo? — a2 + Ziou (89.22) 
where 
k2 4 xu?o? / OT 
” = Dou Dea T (ar) | a 


is the coefficient of absorption of sound in the medium (see FM, §77), and yr 
is the part of this due to thermal conduction. The final result for frequencies 
near the values œ = +ku, where the fluctuations are especially great, is 


(Pok = Tiy (o F ku)? +u?y?]. (89.24) 


This formula is valid when |œ Fku| < uy. 

In the low-frequency range (89.19) it is sufficient, as already mentioned, to 
consider the fluctuations of entropy, neglecting those of pressure. This means 
that in (89.10) we can put 


T ~ (OT/0s)p ôs = (T/cp) Os, 


the specific heat c, being taken per unit mass. The required correlation function 
therefore satisfies an equation of the same type as (89.15), and the initial 
condition is (88.4). The result is 


.2cp xk 


(ôs? Jak = 


PROBLEMS 


PROBLEM 1. Find the correlation function of the fluctuations of the number of solute par- 
ticles in a weak solution. 


SoLuTion. The number density n of solute particles satisfies the diffusion equation 


On/Ot = D An, 


t It may be recalled (see FM, §77) that the hydrodynamic sound-absorption coefficient is 
always small in gases (the inequality y < k necessarily follows from the condition kl « 1), and 
is small in liquids where there is no significant dispersion of sound. 
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where D is the diffusion coefficient. In a weak solution, the simultaneous values of the density 
at different points are uncorrelated (just as there is no one-time correlation for the density 
of an ideal gas); the one-time correlation function is therefore 


<6n(r,) ÔN) = BÔ — r3). 
Similarly to (89.25), we find 
(nok = 27k?D/(w?+ k*D?*). 
Here, thermal diffusion is neglected, and the fluctuations of n may therefore be treated 
independently of those of temperature. 


PROBLEM 2. Find the correlation function of pressure fluctuations in a liquid having a large 
and dispersive second viscosity €(w) (due to slow relaxation of some parameter). 


SoLuTION. The presence of slow relaxation processes causes a second viscosity of the form 
To 2 2 
w) = ———— (uso — u 


where 7 is the relaxation time, uọ the equilibrium velocity of sound, and u,, the velocity of 
sound for a constant value of the relaxation parameter; see FM, §78. Equations (89.20) and 
(89.21), and therefore (89.22), are valid also in the presence of dispersion. Putting ¢ = (o) 
and neglecting the terms arising from 7 and x, we obtain after some calculation 


2T rous(uz, — up) 


(OP*)uk = (uz — o/h)? +? t?(u2, — w?/k?)? i 


§ 90. Operator expressions for the transport coefficients 


The formulae (88.20) and (88.21) may be viewed differently by reading them 
“from right to left”, i.e. by regarding them as expressions for the thermal 
conductivity and the viscosity. The correlation functions on the left may then 
be expressed, in accordance with their definition, in terms of the operators of 
certain quantities having microscopic significance; we thereby obtain the trans- 
port coefficients of the liquid, expressed in terms of these operators. 

First of all, we must note that the absence of any correlation between the 
fluctuations of the “random” energy and momentum fluxes at different points 
in space (the delta function 6(r1—r2) in (88.20) and (88.21)) is a consequence of 
the hydrodynamic approximation, which is valid only for small values of the 
wave vector. In order to express this condition explicitly, we write the formulae 
in Fourier components with respect to the spatial coordinates (which amounts 
to substituting unity for the factors ô(rı—rə)) and take the limit k —> 0. For 
example, formula (88.20) contracted with respect to the suffixes i and k, 


(g® .g®),, = 36(r1—re) hwT coth (hw/2T).re x(a), 
is written as 


= ho ge 
re x(@) = ok tanh ral jim. (27) cote (90.1) 


Itis easy to see that in this formula we can replace the “random” heat flux g by 
the total energy flux, which we shall denote by Q. The latter, as we know from 
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hydrodynamics, consists of the convective energy flux and the heat flux q: 
Q = (Fv?+w) ov+q ~ owv—x VT+8, (90.2) 


where w is the heat function per unit mass of the liquid; in the last expression, 
we omit the term containing a higher power of the fluctuation velocity v. For 
small k, however, the fluctuations of the actual physical quantities v, T, 0, etc., 
contain an extra power of k in comparison with those of the random fluxes, 
and therefore the fluctuations of g are the same as those of Q in the limit 
k — 0. This is immediately evident from the fact that the fluxes g and S, appear 
in the equation of motion (88.6)-(88.8) of the hydrodynamic fluctuations only as 
spatial derivatives, but the physical quantities mentioned occur also as time 
derivatives; when the Fourier components are taken, the latter quantities are 
therefore of order k/w relative to the former. 

Unlike g, the total energy flux Q has a direct mechanical significance and 
corresponds to a definite quantum-mechanical operator Q(t, r) expressible in 
terms of the operators of dynamic variables of particles in the medium. With 
the definition of the correlation functions in terms of the (Heisenberg) operators 
of the corresponding quantities, we thus arrive at the formula 


1 h 
re x(w) = ChoT tanh rx 


oo 


x lim i f elot—k Q(t, r) Ô(0, 0) + Ô(0, 0) Ô(r, r)) dt d8x (90.3) 
k= 0J 


(M. S. Green 1954). 

A more useful representation of the function x(w) is, however, obtained by 
means of a formula expressing the correlation function in terms of the commu- 
tator of the corresponding operators. 

If x,(r) and x,(r) are two fluctuating quantities (equal to zero in equilibrium 
and behaving in the same way under time reversal), their correlation function 
may be written, according to (76.1) and (75.11), as 


(x x), = coth oe f e'N [Ral(t, r1), (0, r2)]) dt, 


0 


where [...] denotes the commutator. Changing to a Fourier expansion with 
respect to the coordinates r = rı—r2, we obtain 


(XaXd)ok = coth $o., eff eot-k.1X[ 2t, r), 20, 0)]) dt d°x. (90.4) 
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Applying this formula to the correlation function (Q?),,, and substituting in 
(90.1), we have 


re x(w) = — lim re Í f el(ot—k XT Q(t, r), Q(0, 0)]) dt d?x. 
oT x 0 ; 

The two sides of this equation contain the real parts of functions of œ that 
tend to Zero as œw —> oo and have no singularity in the upper half-plane of the 
complex variable w. If the real parts of such functions are equal on the real 
axis of œ, it follows that the functions themselves are equal, and we arrive at the 
final formula 


oo 


a jl elot—k XT Qcr, r), Q(0, 0)]) dt d3x. (90.5) 
k— 0. 3 


a= 30T 





In order to derive the static value of the thermal conductivity, we must then also 
take the limit œw — 0. 

Similarly, formula (88.21) may be transformed into an operator expression 
for the viscosity coefficients. 

If we use the total momentum flux o, = —P6,,+0, (with o;, from (88.9)) 
then in the limit k — 0 the fluctuations of all terms except sy, become zero, 
and So in this limit we can replace the correlation function (S;4Sm)ok DY (FizF 1m) cok: 
The result is 


2 
n(o) (ndun + ÔimôkI — 3 bik ôm) +(e) ÖikÔim 


1 lim f f elot —k -1X [ôik(t, r), Gim(O, O)]) dt dx, (90.6) 
k — 0 ¿ 

where 6;,(t,r) is the momentum flux density operator (H. Mori 1958). Contract- 

ing this equation with respect to the pairs of suffixes i, k and /, m or i, l and 

k, m, we obtain separate expressions for 9¢ or for 10n +3¢ respectively. 


§ 91. Dynamic form factor of a Fermi liquid 


Formulae (87.4)-(87.6) for the form factor at T = 0 are not applicable to a 
Fermi liquid, since their derivation assumes the existence (for small œw and k) 
of only the phonon branch of the spectrum of elementary excitations. The 
hydrodynamic theory of fluctuations developed in §§88 and 89 is also inappli- 
cable to a Fermi liquid. It requires the fulfilment of the condition kl « 1 (where 
lis the quasi-particle mean free path), which is certainly not satisfied in a Fermi 
liquid, since / oc T~? and tends to infinity as T + 0. Hence the transport equa- 
tion must be used to calculate the form factor of a Fermi liquid. 
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Here it is convenient to start from equations (86.17)-(86.20), which give the 
relation between the form factor and the generalized susceptibility with respect 
to the action of a field U(t, r) on the liquid. In Fourier components also with 
respect to the coordinates, the definition (86.18) becomes 


bii(@, k) = —a(c, k) Usp- (91.1) 


We shall consider only the case T = 0. Then the dynamic form factor is ex- 
pressed in terms of (w, k) by 


(91.2) 


ñola, k) = | 2hima(w,k), w>0, 


0, w < 0. 


The density perturbation ôñ(w, k) is calculated by means of the transport 
equation, in which the collision integral may be neglected (as T — 0). These 
calculations only differ from those for zero sound in §4 by the addition of the 
term 

U(t, 1) = Uny ele 3-20 


in the quasi-particle energy. Correspondingly, the derivative 0e/Or (4.3) con- 
tains an added term 0U/Or = ikU, and on the left of the transport equation 
(4.8) there is a term 


—ikU .Ono/Op = ik.vUd(e— er). 
The solution of the transport equation is sought in the form 


dn(p) = 4n,.4(p) eM 7-9), 
(91.3) 





drax(B) = ôe- er) tl), n = Ip. 


This is the Fourier component of the perturbation of the quasi-particle momen- 
tum distribution. The required change in the density of the total number of 
quasi-particles (the number density of actual particles) is given by the integral 


(o, k) = | Srl ).2 d2p/(Qach) = — 5 | 1D Ume 


The definition of the function y(n) in (91.3) differs from that of »(m) in (4.9) as 
regards normalization: here, it is chosen so that formula (91.2) becomes 


fio(w, k) = im f y(n) do/4x, w>0. (91.4) 
For y(n) itself, we get the equation 
(o— vrk .n) y(n)— vrk .n f F(8) x(n’) do'/4n = —k.n.2p}/n?h?, (91.5) 


which differs on the right-hand side from (4.11). 
Equation (91.5) does not explicitly contain imaginary quantities. The pres- 
ence of an imaginary part of its solution y(n) is therefore due only to the pas- 
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Sages round poles in the integrals that arise during the process of solution. 
The rule for these passages depends on the requirement that the field U oc ei! 
applied to the system is applied adiabatically from t = — œ onwards; for this 
purpose, its frequency w must be replaced by w+ i0. 

The specific form of the solution depends on the form of the quasi-particle 
interaction function F(#). We shall illustrate the process of solution and the 
resulting properties for the simplest example, in which F = Fo, a constant. 

In this case, the solution of equation (91.5) has the form 


y(n) = Cork.n/(vpk.n—w—i0), (91.6) 


where C is a constant. This constant is determined by substituting (91.6) back 
into (91.5), which gives 


C41) = 2m*pr/n?h?, (91.7) 


I= k.n' vr do’ 
= k.n’vp—w—i0 ` 4n ` 


where 


The integrand depends only on the angle between n’ and k, and an obvious 
substitution gives 


1 
nae x dx | stl | (fisx, s<1, 
I(s) = = —— I-58 log s-i i+ 0, eat (91.8) 





~1 


where s = w/kv,; the imaginary part of the integral is determined by the rule 
(8.11). 

Substituting the function y(n) from (91.6)-(91.8) in (91.4), we get the dynamic 
form factor 
aes 2m* pr a I (s) 
— mk 1+ Fol(s) 


(A. A. Abrikosov and I. M. Khalatnikov 1958). According to (91.8), this is 
non-zero for s < 1, i.e. for all w < kvp. 

If Fo > 0, zero sound can be propagated in a Fermi liquid with a velocity uo 
determined by (4.15): 


fio(w, k) (91.9) 


1+Fol(so) = 0, So = uo/vF. 
For values of s close to so, the expression (91.9) becomes 
constant X im 1/(s— so); 


according to the comment made above, s = w/kvp is to be taken as s+i0. 
This means that o(w, k) contains also a delta-function term having the form 
constant x d(s—so), or 


o(w, k) = constant X kô(w— kuo). (91.10) 
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This term is the contribution to the form factor from the zero-sound branch of 
the energy spectrum of the Fermi liquid; it is exactly analogous to the phonon 
contribution (87.4) to the form factor of a Bose liquid. 

The existence of such a term does not, of course, depend on the assumption 
that Fis constant; it is a general property of a Fermi liquid in which zero sound 
can be propagated. Only the value of the constant coefficient in (91.10) depends 
on the law of interaction of quasi-particles. Equation (91.5), with zero on 
the right-hand side, is the equation of zero sound; the solution of the inhomo- 
geneous equation therefore has a pole at w/k = uo. 

It is clear from the form of (91.5) that its solution depends on the parameters 
w and k only through the ratio w/k. The dynamic form factor will therefore 
also be a function of this ratio. The static form factor 


o(k) = f o(w, k) go 


will consequently have the form 


o(k) = constant X k. (91.11) 


This means that the one-time spatial correlation function of the density fluc- 
tuations at T = 0 in a Fermi liquid obeys the law v(r) oc r~4, as in a Bose 
liquid. 

Lastly, we may note that the dynamic form factor of an ideal Fermi gas is 
obtained from (91.9) by taking the limit Fo — 0: 


olw, k) = m’a/nh’nk, 0 <w < kor. 
The static form factor is then 


kup 


o(k) = f o(%, nie 


Pu 


0 


Pk 
(Qh ’ 


in agreement with the result in Part 1,§117, Problem 1. 
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